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Integrated x86 Solution with MMX Support 


Introduction 


¢ High Performance 
- Processor speeds up to 300MHz 
- Write-Back cache 
- Memory management with Load Store and 
Memory-Read Bypassing 
- Six-stage integer pipeline 
- XpressRAM™ and XpressGRAPHICS™ 


@ MediaGX™ MMX™-Enhanced Processor 
- Processor Integrated Functions: 
- Graphics Pipeline 
- Memory Controller (SDRAM) 
- Display Controller 
- PCI Controller 
- Interfaces with Cx5520 or Cx5530 I/O 
Companion chip 
- 320 SPGA or 352 BGA package 


¢@ x86 Instruction Set with MMX Support 
- Compatible with MMX Technology 
- Runs Windows®95, Windows 3.x, Windows 
NT, DOS, UNIX®, OS/2®, Solaris®, and others 


The MediaGX™ MMX™-Enhanced Processor, in 
combination with the Cx5520 or Cx5530 I/O 
Companion chip provides advanced video and 
audio functions and permits direct interface to 
memory. This high-performance 64-bit processor is 
x86 instruction set compatible and supports MMX 
technology. 


This processor is the latest member of the Cyrix 
MediaGX family, offering high performance, fully 
accelerated 2D graphics, a synchronous memory 
interface and a PCI bus controller, all on a single 
chip. As described in separate manuals, the 
Cx5520 and Cx5530 I/O Companion chips enable 
the full features of the MediaGX processor with 
MMxX support. These features include full VGA and 
VESA video, 16-bit stereo sound, IDE interface, 
ISA interface, SMM power management, and AT 
compatibility logic. In addition, the newer Cx5530 
provides an Ultra DMA/33 interface, MPEG2 
assist, and is AC97 Version 2.0 compliant audio. 


Write-Back Memory Integer Floating 
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sd e od 
Integrated Graphics Memory Display PCI 
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1 Overview 


The Cyrix MediaGX™ MMX™-Enhanced 
Processor is the latest member of the Cyrix 
MediaGX processor family. It is an advanced 64-bit 
x86 compatible processor offering high perfor- 
mance, fully accelerated 2D graphics, a 64-bit 
synchronous DRAM controller and a PCI bus 
controller, all on a single chip. Plus it is compatible 
with MMX™ technology. This latest generation of 
the MediaGX processor enables a new class of low 
cost, premium performance notebook/desktop 
computer designs. 


The MediaGX processor core is a proven design 
that offers competitive CPU performance. It has 
integer and floating point execution units that are 
based on sixth-generation technology. The integer 
core contains a single, six-stage execution pipeline 
and offers advanced features such as operand 
forwarding, branch target buffers, and extensive 
write buffering. A 16KB write-back L1 cache is 
accessed in a unique fashion that eliminates pipe- 
line stalls to fetch operands that hit in the cache. 


In addition to the advanced CPU features, the 
MediaGX processor integrates a host of functions 
which are typically implemented with external 
components. A full-function graphics accelerator 
provides pixel processing and rendering functions. 


A separate on-chip video buffer enables >30FPS 
MPEG1 video playback when used together with 
either the Cx5520™ or Cx5530™ I/O Companion 
chip. Graphics and system memory accesses are 
supported by a tightly-coupled synchronous DRAM 
(SDRAM) memory controller. This tightly coupled 
memory subsystem eliminates the need for an 
external L2 cache. 


Integrated x86 Solution with MMX Support 


=< 


The MediaGxX processor includes Cyrix’s Virtual 
System Architecture™ (VSA™) enabling Xpress- 
GRAPHICS™ and XpressAUDIO™ as well as 
generic emulation capabilities. Software handler 
routines for XpressGRAPHICS and XpressAUDIO 
are included in the BIOS and provide compatible 
VGA and 16-bit industry standard audio emulation. 
XpressAUDIO technology eliminates much of the 
hardware traditionally associated with audio func- 
tions. 


General Features 
¢ Packaged in: 
- 352-Terminal Ball Grid Array (BGA) or 
- 320-Pin Staggered Pin Grid Array (SPGA) 


* 0.35-micron four layer metal CMOS process 


* Split rail design (3.3V I/O and 2.9V core) 


64-Bit x86 Processor 


* Supports the MMX"™ instruction set extension 
for the acceleration of multimedia applications 


* Speeds offered up to 300MHz 
¢ 16KB unified L1 cache 
« Integrated Floating Point Unit (FPU) 


* Re-entrant System Management Mode (SMM) 
enhanced for the Cyrix Virtual System Architec- 
ture 


GXm_db_v2.0 


Cyrix Corporation Confidential 


Page 1 


‘Media(OX 


M M X"*- E N H AN € E D 


PCI Coniroller 
¢ Fixed, rotating, hybrid, or ping-pong arbitration 


¢ Supports up to three PCI bus masters 
* Synchronous CPU and PCI bus clock frequency 


¢ Supports concurrency between PCI master and 
L1 cache 


Power Management 
¢ Designed to support Cx5520/Cx5530 power 
management architecture 


* CPU only Suspend or full 3V Suspend 
supported: 
- Clocks to CPU core stopped for CPU 
Suspend 
- All on-chip clocks stopped for 3V Suspend 
- Suspend refresh supported for 3V Suspend 


Virtual Systems Architecture™ 


¢ New architecture allowing OS independent (soft- 
ware) virtualization of hardware functions 


« Provides compatible high performance legacy 
VGA core functionality 


Note: GUI (Graphical User Interface) graphics 
acceleration is pure hardware. 


* Provides Cyrix’s 16-bit XoressAUDIO™ 


2D Graphics Accelerator 


* Graphics pipeline performance significantly 
increased over previous generations by pipe- 
lining burst reads/writes 


* Accelerates BitBLTs, line draw, text 
¢ Supports all 256 raster operations 
¢ Supports transparent BLTs 


« Runs at core clock frequency 


* Full VGA and VESA mode support 


* Special "Driver level” instructions utilize internal 
scratchpad for enhanced performance 


Display Controller 
« Video Generator (VG) improves memory effi- 
ciency for display refresh with SDRAM 


¢ Supports a separate MPEG1 video buffer and 
data path to enable video acceleration in the 
Cx5520 


* Supports a separate MPEG2 video buffer and 
data path to enable video acceleration in the 
Cx5530 


¢ Internal palette RAM for use with the 
Cx5520/Cx5530 


¢ Direct interface to Cx5520/Cx5530 for CRT and 
TFT flat panel support which eliminates need for 
external RAMDAC 


¢ Hardware frame buffer compressor/decom- 
pressor 


¢ Hardware cursor 


* Supports up to 1280x1024x8 BPP and 
1024x768x16 BPP 


XpressRAM™ Memory Subsystem 
* Memory control/interface directly from CPU 


* 64-Bit wide memory bus 


* SDRAM bus operating frequency range of 66 to 
100MHz 


* Support for: 
- Two 168-pin unbuffered DIMMs 
- Upto 16 open banks simultaneously 
- Single or 16-byte reads (burst length of two) 


« LVTTL technology compatible 
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1.1 Architecture 


The Cyrix MediaGX MMX-Enhanced Processor 
represents a new generation of x86-compatible 64- 
bit microprocessors with sixth-generation features. 
The decoupled load/store unit (within the memory 
management unit) allows multiple instructions in a 
single clock cycle. Other features include single- 
cycle execution, single-cycle instruction decode, 
16KB write-back cache, and clock rates up to 
300MHz. These features are made possible by the 
use of advanced-process technologies and super- 


pipelining. 


The MediaGX processor has low power consump- 
tion at all clock frequencies. Where additional 
power savings are required, designers can make 
use of Suspend mode, Stop Clock capability, and 
System Management Mode (SMM). 


The MediaGxX processor is divided into major func- 
tional blocks (as shown in Figure 1-1): 

¢ Integer Unit 

¢ Floating Point Unit (FPU) 

« Write-Back Cache Unit 

« Memory Management Unit (MMU) 

¢ Internal Bus Interface Unit 

¢ Integrated Functions 


Instructions are executed in the integer unit and in 
the floating point unit. The cache unit stores the 
most recently used data and instructions and 
provides fast access to this information for the 
integer and floating point units. 


___ | Write-Back Integer 
Cache Unit MMU Unit FPU 
C-Bus | | 
e 
Internal Bus Interface Unit 
X-Bus 
® ® 
Graphi M Displ PCI 
raphics emory isplay 
Baie Pipeline Controller Controller Controller 
SDRAM Port Cx5520/Cx5530 PCI Bus 
(CRT/LCD TFT) 
Figure 1-1 Internal Block Diagram 
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1.1.1 Integer Unit 

The integer unit consists of: 

¢ Instruction Buffer 

¢ Instruction Fetch 

¢ Instruction Decoder and Execution 


The superpipelined integer unit fetches, decodes, 
and executes x86 instructions through the use of a 
six-stage integer pipeline. 


The instruction fetch pipeline stage generates, 
from the on-chip cache, a continuous high-speed 
instruction stream for use by the processor. Up to 
128 bits of code are read during a single clock 
cycle. 


Branch prediction logic within the prefetch unit 
generates a predicted target address for uncondi- 
tional or conditional branch instructions. When a 
branch instruction is detected, the instruction fetch 
stage starts loading instructions at the predicted 
address within a single clock cycle. Up to 48 bytes 
of code are queued prior to the instruction decode 
stage. 


The instruction decode stage evaluates the code 
stream provided by the instruction fetch stage and 
determines the number of bytes in each instruction 
and the instruction type. Instructions are processed 
and decoded at a maximum rate of one instruction 
per clock. 


The address calculation function is super-pipelined 
and contains two stages, AC1 and AC2. If the 
instruction refers to a memory operand, AC1 calcu- 
lates a linear memory address for the instruction. 


The AC2 stage performs any required memory 
management functions, cache accesses, and 
register file accesses. If a floating point instruction 
is detected by AC2, the instruction is sent to the 
floating point unit for processing. 


The execution stage, under control of microcode, 
executes instructions using the operands provided 
by the address calculation stage. 


Write-back, the last stage of the integer unit, 
updates the register file within the integer unit or 
writes to the load/store unit within the memory 
management unit. 


1.1.2 Floating Point Unit 


The FPU (Floating Point Unit) interfaces to the 
integer unit and the cache unit through a 64-bit 
bus. The FPU is x87-instruction-set compatible and 
adheres to the IEEE-754 standard. Because 
almost all applications that contain FPU instruc- 
tions also contain integer instructions, the 
MediaGX processor’s FPU achieves high perfor- 
mance by completing integer and FPU operations 
in parallel. 


FPU instructions are dispatched to the pipeline 
within the integer unit. The address calculation 
stage of the pipeline checks for memory manage- 
ment exceptions and accesses memory operands 
for use by the FPU. Once the instructions and 
operands have been provided to the FPU, the FPU 
completes instruction execution independently of 
the integer unit. 


1.1.3 Write-Back Cache Unit 


The 16KB write-back unified cache is a 
data/instruction cache and is configured as four- 
way set associative. The cache stores up to 16KB 
of code and data in 1024 cache lines. 


The MediaGX processor provides the ability to allo- 
cate a portion of the L1 cache as a scratchpad, 
which is used to accelerate the Virtual Systems 
Architecture algorithms as well as for some 
graphics operations. 
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1.1.4 Memory Management Unit 


The memory management unit (MMU) translates 
the linear address supplied by the integer unit into 
a physical address to be used by the cache unit 
and the internal bus interface unit. Memory 
management procedures are x86-compatible, 
adhering to standard paging mechanisms. 


The MMU also contains a load/store unit that is 
responsible for scheduling cache and external 
memory accesses. The load/store unit incorpo- 
rates two performance-enhancing features: 


¢ Load-store reordering that gives priority to 
memory reads required by the integer unit over 
writes to external memory. 


¢ Memory-read bypassing that eliminates 
unnecessary memory reads by using valid data 
from the execution unit. 


1.1.5 Internal Bus Interface Unit 


The internal bus interface unit provides a bridge 
from the MediaGX processor to the integrated 
system functions (i.e., memory subsystem, display 
controller, graphics pipeline) and the PCI bus inter- 
face. 


When external memory access is required, the 
physical address is calculated by the memory 
management unit and then passed to the internal 
bus interface unit, which translates the cycle to an 
X-Bus cycle (the X-Bus is a Cyrix proprietary 
internal bus which provides a common interface for 
all of the system modules). The X-Bus memory 
cycle now is arbitrated between other pending X- 
Bus memory requests to the SDRAM controller 
before completing. 


In addition, the internal bus interface unit provides 
configuration control for up to 20 different regions 
within system memory with separate controls for 
read access, write access, cacheability, and PCI 
access. 


1.2 Integrated Functions 


The MediaGX processor integrates the following 
functions traditionally implemented using external 
devices: 


¢ High-performance 2D graphics accelerator 


* Separate CRT and TFT data paths from the 
display controller 


* SDRAM memory controller 
* PCl bridge 


The processor has also been enhanced to support 
Cyrix’s proprietary Virtual System Architecture 
(VSA) implementation. 


The MediaGX processor implements a Unified 
Memory Architecture (UMA). By using Cyrix’s 
Display Compression Technology™ (DCT), the 
performance degradation inherent in traditional 
UMA systems is eliminated. 


1.2.1 Graphics Accelerator 


The graphics accelerator is a full-featured GUI 
(Graphical User Interface) accelerator. The 
graphics pipeline implements a bitBLT engine for 
frame buffer bitBLTs and rectangular fills. Addi- 
tional instructions in the integer unit may be 
processed, as the bitBLT engine assists the CPU in 
the bitBLT operations that take place between 
system memory and the frame buffer. This combi- 
nation of hardware and software is used by the 
display driver to provide very fast transfers in both 
directions between system memory and the frame 
buffer. The bitBLT engine also draws randomly- 
oriented vectors, and scanlines for polygon fill. All 
of the pipeline operations described in the following 
list can be applied to any bitBLT operation. 
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¢ Pattern Memory. Render with 8x8 dither, 8x8 
monochrome, or 8x1 color pattern. 


* Color Expansion. Expand monochrome 
bitmaps to full-depth 8- or 16-bit colors. 


* Transparency. Suppresses drawing of back- 
ground pixels for transparent text. 


« Raster Operations. Boolean operation 
combines source, destination, and pattern 
bitmaps. 


1.2.2 Display Controller 


The display port is a direct interface to the 
Cx5520/Cx5530 which drives a TFT flat panel 
display, LCD panel, or a CRT display. 


The display controller (video generator) retrieves 
image data from the frame buffer region of 
memory, performs a color-look-up if required, 
inserts the cursor overlay into the pixel stream, 
generates display timing, and formats the pixel 
data for output to a variety of display devices. The 
display controller contains Display Compression 
Technology (DCT) that allows the MediaGX 
processor to refresh the display from a 
compressed copy of the frame buffer. DCT typically 
decreases the screen-refresh bandwidth require- 
ment by a factor of 15 to 20, further minimizing 
bandwidth contention. 


1.2.3 XpressRAM™ Memory 


Subsystem 


The memory controller drives a 64-bit SDRAM port 
directly. The SDRAM memory array contains both 
the main system memory and the graphics frame 
buffer. Up to four module banks of SDRAM are 
supported. Each module bank will have two or four 
component banks depending on the memory size 
and organization. The maximum configuration is 
four module banks with four component banks 
providing a total of 16 open banks. The maximum 
memory size is 1GB. 


The memory controller handles multiple requests 
for memory data from the MediaGX processor, the 
graphics accelerator and the display controller. The 
memory controller contains extensive buffering 
logic that helps minimize contention for memory 
bandwidth between graphics and CPU requests. 
The memory controller cooperates with the internal 
bus controller to determine the cacheability of all 
memory references. 


1.2.4 PCI Controller 


The MediaGX processor incorporates a full-func- 
tion PCI interface module that includes the PCI 
arbiter. All accesses to external I/O devices are 
sent over the PCI bus, although most memory 
accesses are serviced by the SDRAM controller. 
The Internal Bus Interface Unit contains address 
mapping logic that determines if memory accesses 
are targeted for the SDRAM or for the PCI bus. 
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1.3 System Designs include full VGA and VESA video, 16-bit stereo 


The Cyrix MediaGX™ Integrated Subsystem with sound, IDE interface, ISA interface, SMM power 
MMX™ support consists of two chips, the management, and AT compatibility logic. In addi- 
MediaGX MMX-Enhanced Processor and the tion, the newer Cx5530 provides an Ultra DMA/33 
Cx5520™ or Cx5530™ I/O Companion. The interface, MPEG2 assist, and AC97 Version 2.0 


subsystem provides high performance using 64-bit Compliant audio. 


x86 processing. The two chips integrate video, Figure 1-2 shows a basic block system diagram 
audio and memory interface functions normally (refer to Figure 2-4 on page 34 for detailed 
performed by external hardware. subsystem interconnection signals). It includes the 


Cyrix Cx9210™ Dual-Scan Flat Panel Display 
Controller for designs that need to interface to a 
DSTN panel (instead of TFT panel). 


As described in separate manuals, the Cx5520 and 
Cx5530 enable the full features of the MediaGX 
processor with MMX support. These features 


MD[63:0] SDRAM YUV Port 
Port (Video) 
SDRAM MediaGxX™ 
MMX™-Enhanced 
Processor 


Clocks ; 
Serial 
Packet RGB Port 
(Graphics) 
CRT 
USB PCI Interface 
(2 Ports) |“ System i 
ate PCI Bus 
TFT 
Speakers + —_—_ | Panel 
Graphics Data 
Video Data 
Analog RGB 
Bee AC97 Cx55x0™ ae 
| CopEC 1/0 Companion Digital RGB (to TFT or DSTN Panel) 
Audio 
IDE Control 
: Cx9210™ 
Micro- 
phone Super IDE DSTN 
14.31818 /O BIOS Devices Controller 
GPIO <—— MHz Crystal 


ISA Bus 


DC-DC & Battery 


DSTN Panel 


Note: Dashed lines denote Cx5520 application. 


Figure 1-2 System Block Diagram 
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The Cx9210 converts the digital RGB output of a scan flat panel LCD. It can drive all standard dual- 
Cx5520 or Cx5530 I/O Companion chip to the scan color STN flat panels up to 1024x768 resolu- 
digital output suitable for driving a dual-scan color tion. Figure 1-3 shows an example of a Cx9210 
STN (DSTN) flat panel LCD. It connects to the interface in a typical MediaGX Integrated 

digital RGB output of a MediaGX™ processor or Subsystem. 


Cx55x0 and drives the graphics data onto a dual- 


Pixel Data 18 


Pixel Port DRAM A 
Cx55x0™ Cx9210™ jg DRAM Data 88 | 256K 16 
vO DSTN 


Companion Controller 
Control 


MediaGX™ 
Processor 


Figure 1-3 Cx9210 Interface System Diagram 
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2 Signal Definitions 


This section describes the external interface of the 
MediaGX processor. Figure 2-1 shows the signals 


System 
Interface 
Signals 


PCI 
Interface 
Signals 


SYSCLK 
CLKMODE[2:0] 
RESET 

INTR 

IRQ13 

SMI# 

SUSP# 
SUSPA# 
SERIALP 


AD[31:0] 
C/BE[3:0}# 
PAR 
FRAME# 
IRDY# 
TRDY# 
STOP# 
LOCK# 
DEVSEL# 
PERR# 
SERR# 
REQ[2:0]# 
GNT[2:0]# 


MediaGX™ 
MMX™-Enhanced 


Processor 


Figure 2-1 Functional Block Diagram 


MD[63:0] 
MA[12:0] 
BA[1:0] 
RASA#, RASB# 
CASA#, CASB# 
CS[3:0]# 
WEA#, WEB# 
DQMI7:0] 
CKEA, CKEB 
SDCLK[3:0] 
SDCLK_IN 
SDCLK_OUT 
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CRT_VSYNC 
FP_HSYNC 
FP_VSYNG 
ENA_DISP 
VID_RDY 
VID_VAL 
VID_DATAI7:0] 
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organized by their functional interface groups 
(internal test and electrical pins are not shown). 


Memory 
Controller 
Interface 
Signals 


Video 
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2.1 Pin Assignments 


The MediaGX MMX-Enhanced processor is avail- 
able in two packages, a 352 BGA package and a 
320 SPGA package. 


The pin assignment for the 352 BGA is shown in 
Figure 2-2. Tables 2-2 and 2-3 are pin assignment 
lists for the 352 BGA sorted by pin number and 
alphabetically by signal name, respectively. 


The 320 SPGA pin assignment is shown in Figure 

2-3. Tables 2-4 and 2-5 are pin assignment lists for 
the 320 SPGA sorted by pin number and alphabet- 
ically by signal name, respectively. 


Abbreviations used in Tables 2-4 through 2-5 are 
shown in Table 2-1. 


Section 2.2 on page 21 describes the signals which 
are grouped according their functional group. 


Table 2-1 


Pin Type Definitions 


Mnemonic 


Definition 


Standard input pin. 


Bidirectional pin. 


Totem-pole output. 


Open-drain output structure that 
allows multiple devices to share the 
pin in a wired-OR configuration 


Pull-up resistor 


Pull-down resistor 


s/t/s 


Sustained tristate, an active-low 
tristate signal owned and driven by 
one and only one agent at a time. 
The agent that drives an s/t/s pin low 
must drive it high for at least one 
clock before letting it float. A new 
agent cannot start driving an s/t/s 
signal any sooner than one clock 
after the previous owner lets it float. 
A pull-up resistor is required to sus- 
tain the inactive state until another 
agent drives it, and must be pro- 
vided by the central resource. 


VCC (PWR) 


Power pin. 


VSS (GND) 


Ground pin 


# 


The "#" symbol at the end of a signal 
name indicates that the active, or 
asserted state occurs when the sig- 
nal is at a low voltage level. When 
"#" is not present after the signal 
name, the signal is asserted when at 
a high voltage level. 
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Note: Signal names have been abbreviated in this figure due to space constraints. 
@ = GND terminal 
© = PWR terminal (VCC2 = VCC_CORE; VCC3 = VCC_IO) 


Figure 2-2 352 BGA Pin Assignment Diagram 
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Table 2-2 352 BGA Pin Assignments - Sorted by Pin Number 


Pin Pin Pin Pin Pin 
No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name 
A1|VSS B15|AD9 D3 | REQ2# G1/}VCC3 M1 | CLKMODE2 
A2|VSS B16] AD7 D4|VSS G2 | VCC3 M2 | VID_VAL 
A3 | AD27 B17| VCC2 D5 | C/BE3# G3} VCC3 M3 | CLKMODEO 
A4| AD24 B18] INTR D6| VSS G4/}VCC3 M4/VSS 
A5|AD21 B19] AD3 D7 | VCC2 G23 | VCC3 M23] VSS 
A6|AD16 B20 | VCC3 D8|VSS G24|VCC3 M24 | MD44 
A7|VCC2 B21|TEST1 D9| VSS G25 | VCC3 M25) MD13 
A8 | FRAME# B22|TEST3 D10| VCC3 G26 | VCC3 M26 | MD45 
AQ | DEVSEL# B23|MD1 D11|VSS H1| TMS N1| VSS 
A10| VCC3 B24 | MD33 D12/VSS H2 | SUSP# N2| PIXEL1 
A1i1 | PERR# B25| VSS D13|VSS H3 | REQ1# N3 | PIXELO 
A12|}AD15 B26| VSS D14/VSS H4| VSS N4| VSS 
A13/VSS C1 | AD29 D15/VSS H23/VSS N23|VSS 
A14|AD11 C2|AD31 D16/VSS H24|MD8 N24 |MD14 
A15| C/BEO# C3 | AD30 D17|VCC2 H25 | MD40 N25 | MD46 
A16|AD6 C4] AD26 D18/VSS H26 | MD9 N26 |MD15 
A17|VCC2 C5 | AD23 D19;/VSS J1|FP_VSYNC P1)VID_CLK 
A18}AD4 Cé6|AD19 D20|VCC3 J2|TCLK P2| PIXEL3 
A19}AD2 C7|VCC2 D21/VSS J3 | RESET P3 | PIXEL2 
A20|VCC3 C8|AD17 D22|MDO J4/ VSS P4| VSS 
A21| ADO C9 | IRDY# D23/VSS J23| VSS P23/VSS 
A22|AD1 C10] VCC3 D24|MD4 J24 |MD41 P24 | MD47 
A23| TEST2 C11 | STOP# D25 | MD36 J25 | MD10 P25 | CASA# 
A24|MD2 C12| SERR# D26| TDN J26 | MD42 P26 | SYSCLK 
A25|VSS C13 | C/BE1# E1 | GNT2# K1 | VCC2 R1 | PIXEL4 
A26|VSS C14|AD13 E2 | SUSPA# K2| VCC2 R2 | PIXEL5 
B1|VSS C15}AD10 E3 | REQO# K3 | VCC2 R3 | PIXEL6 
B2|VSS C16 | AD8 E4)AD20 K4| VCC2 R4|VSS 
B3 | AD28 C17| VCC2 E23 | MD6 K23 | VCC2 R23|VSS 
B4| AD25 C18) AD5 E24| TDP K24| VCC2 R24 | WEB# 
B5| AD22 C19 | SMl# E25 |MD5 K25 | VCC2 R25 | WEA# 
B6|AD18 C20 | VCC3 E26 | MD37 K26 | VCC2 R26 | CASB# 
B7| VCC2 C21| TESTO F1|TDO L1|CLKMODE1 T1 | PIXEL7 
B8 | C/BE2# C22 | IRQ13 F2| GNT1# L2|FP_HSYNC T2| PIXEL8 
B9| TRDY# C23 | MD32 F3| TEST L3 | SERIALP T3 | PIXEL9 
B10] VCC3 C24 | MD34 F4|VSS L4|VSS T4/VSS 
B11 | LOCK# C25 | MD3 F23/VSS L23)VSS T23}VSS 
Bi2|PAR C26 | MD35 F24|MD38 L24]MD11 T24 | DQMO 
B13|AD14 D1 | GNTO# F25|MD7 L25 | MD43 T25 | DQM4 
B14} AD12 D2| TDI F26 | MD39 L26 |MD12 T26 | DQM1 
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Table 2-2 352 BGA Pin Assignments - Sorted by Pin Number (cont.) 


Pin Pin Pin Pin Pin 
No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name 
U1}VCC3 Y¥26 | VCC2 AC15| VSS AD20| VCC3 AE25/VSS 
U2|VCC3 AA1 | PIXEL15 AC16| VSS AD21 | MD48 AE26/VSS 
U3 | VCC3 AA2 | PIXEL16 AC17|VCC2 AD22 | DQM3 AF1|VSS 
U4} VCC3 AA3 | CRT_VSYNC AC18| VSS AD23|CS1# AF2|VSS 
U23 | VCC3 AA4|VSS AC19| VSS AD24 | MA11 AF3 | VID_DATA1 
U24| VCC3 AA23/VSS ACG20| VCC3 AD25 | BAO AF4| SDCLKO 
U25|VCC3 AA24|MA1 AC21| VSS AD26| BA1 AF5 | SDCLK2 
U26 | VCC3 AA25 | MA2 AC22 | DQM6 AE1|VSS AF6 | MD31 
V1 | PIXEL10 AA26 | MA3 ACG23| VSS AE2|VSS AF7 | VCC2 
V2 | PIXEL11 AB1 | DCLK AC24 | MA8 AE3 | VID_DATA2 AF8 | SDCLK_OUT 
V3 | PIXEL12 AB2 | PIXEL17 AG25 | MAY AE4 | SDCLK3 AF9 | MD30 
V4|VSS AB3 | VID_DATA6 AC26 | MA10 AE5 | SDCLK1 AF10|VCC3 
V23|VSS AB4 | VID_DATA7 AD1 | VID_RDY AE6 | RW_CLK AF 11 | MD60 
V24 | DQM5 AB23 | MA4 AD2 | VID_DATA5 AE7 | VCC2 AF12|MD27 
V25 | CS2# AB24 | MA5 AD3| VID_DATA3 AE8 | SDCLK_IN AF13 | MD57 
V26 | CSO# AB25 | MA6 AD4| VID_DATAO AE9 | MD61 AF14| VSS 
W1 | PIXEL13 AB26 | MA7 AD5| ENA_DISP AE10/VCC3 AF15|MD23 
W2|CRT_HSYNC AC1|PCLK AD6 | MD63 AE11 | MD28 AF16|MD53 
W3 | PIXEL14 AC2| FLT# AD7 | VCC2 AE12|MD58 AF17 | VCC2 
W4|VSS AC3| VID_DATA4 AD8 | MD62 AE13}MD25 AF18| MD52 
W23/VSS AC4|VSS AD9 | MD29 AE14/MD24 AF19|MD19 
W24 | RASA# AC5| VOLDET AD10]| VCC3 AE15|MD54 AF20 | VCC3 
W25 | RASB# AC6| VSS AD11|MD59 AE16|MD21 AF21 | MD49 
W26 | MAO AGC7|VCC2 AD12| MD26 AE17/VCC2 AF22 |MD16 
Y1|VCC2 AC8| VSS AD13 | MD56 AE18/MD20 AF23 | DQM2 
Y2|VCC2 AC9| VSS AD14|MD55 AE19}MD50 AF24 | CKEA 
Y3| VCC2 AC10] VCC3 AD15| MD22 AE20|VCC3 AF25| VSS 
Y4|VCC2 AC11/VSS AD16| CKEB AE21|MD17 AF26| VSS 
Y23 | VCC2 AC12|VSS AD17|VCC2 AE22 | DQM7 
Y24|VCC2 AC13| VSS AD18|MD51 AE23 | CS3# 
Y25|VCC2 AC14| VSS AD19|MD18 AE24|MA12 
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Table 2-3 352 BGA Pin Assignments - Sorted Alphabetically by Signal Name 


Signal Name | Type | Pin No. Signal Name | Type | Pin No. Signal Name | Type | Pin No. Signal Name | Type | Pin No. 
ADO VO A21 CRT_HSYNG [e) w2 MD4 VO D24 MD49 VO AF21 
AD1 VO A22 CRT_VSYNC oO AA3 MD5 VO E25 MD50 VO AE19 
AD2 VO A19 CSO0# (e) V26 MD6 VO E23 MD51 VO AD18 
AD3 VO B19 CS1# oO AD23 MD7 VO F25 MD52 VO AF18 
AD4 VO A18 CS2# oO V25 MD8 VO H24 MD53 VO AF16 
AD5 VO C18 CS3# oO AE23 MD9 VO H26 MD54 VO AE15 
AD6 VO A16 DCLK | AB1 MD10 VO J25 MD55 VO AD14 
AD7 VO B16 DEVSEL# s/t/s | AQ (PU) MD11 VO L24 MD56 VO AD13 
AD8 VO C16 DQMO [e) T24 MD12 VO L26 MD57 VO AF13 
AD9 VO B15 DQM1 oO T26 MD13 VO M25 MD58 VO AE12 
AD10 VO C15 DQM2 oO AF23 MD14 VO N24 MD59 VO AD11 
AD11 VO A14 DQM3 fe) AD22 MD15 VO N26 MD60 VO AF11 
AD12 VO B14 DQM4 oO T25 MD16 VO AF22 MD61 VO AE9 
AD13 ie) C14 DQM5 (e) V24 MD17 VO AE21 MD62 VO AD8 
AD14 VO B13 DQM6 [e) AC22 MD18 ie) AD19 MD63 VO AD6 
AD15 VO A12 DQM7 (eo) AE22 MD19 ie) AF19 PAR VO B12 
AD16 VO A6 ENA_DISP oO AD5 MD20 VO AE18 PCLK oO AC1 
AD17 VO c8 FLT# | AC2 MD21 ie) AE16 PERR# s/t/s | A11 (PU) 
AD18 ie) B6 FP_HSYNC oO L2 MD22 ie) AD15 PIXELO oO N3 
AD19 ie) C6 FP_VSYNC O J1 MD23 VO AF15 PIXEL1 oO N2 
AD20 VO E4 FRAME# s/t/s | A8 (PU) MD24 ie) AE14 PIXEL2 fe) P3 
AD21 VO AS GNTO# oO D1 MD25 ie) AE13 PIXEL3 [e) P2 
AD22 VO B5 GNT1# oO F2 MD26 VO AD12 PIXEL4 oO R1 
AD23 VO C5 GNT2# [e) E1 MD27 VO AF12 PIXEL5 [e) R2 
AD24 VO A4 INTR | B18 MD28 VO AE11 PIXEL6 oO R3 
AD25 VO B4 IRDY# s/t/s | C9 (PU) MD29 VO AD9 PIXEL7 (eo) T1 
AD26 VO C4 IRQ13 [e) C22 MD30 VO AF9 PIXEL8 oO T2 
AD27 VO A3 LOCK# s/t/s | B11 (PU) MD31 ie) AF6 PIXEL9 (e) T3 
AD28 VO B3 MAO oO W26 MD32 VO C23 PIXEL10 ie) V1 
AD29 VO C1 MA1 [e) AA24 MD33 ie) B24 PIXEL11 ie) v2 
AD30 VO C3 MA2 oO AA25 MD34 VO C24 PIXEL12 ie) V3 
AD31 VO C2 MA3 [e) AA26 MD35 VO C26 PIXEL13 1e) w1 
BAO ie) AD25 MA4 [e) AB23 MD36 VO D25 PIXEL14 1e) W3 
BAI oO AD26 MA5 oO AB24 MD37 ie) E26 PIXEL15 ie) AA1 
CASA# oO P25 MA6 oO AB25 MD38 VO F24 PIXEL16 ie) AA2 
CASB# oO R26 MA7 oO AB26 MD39 VO F26 PIXEL17 ie) AB2 
C/BEO# VO A15 MA8 oO AC24 MD40 VO H25 RASA# oO W24 
C/BE1# VO C13 MAQ oO AC25 MD41 ie) J24 RASB# fe) W25 
C/BE2# VO B8 MA10 oO AC26 MD42 VO J26 REQO# I E3 (PU) 
C/BE3# VO D5 MA11 fe) AD24 MD43 VO L25 REQ1# I H3 (PU) 
CKEA oO AF24 MA12 [e) AE24 MD44 VO M24 REQ2# I D3 (PU) 
CKEB oO AD16 MDO VO D22 MD45 VO M26 RESET I J3 
CLKMODEO | M3 MD1 VO B23 MD46 VO N25 RW_CLK oO AE6 
CLKMODE1 | | MD2 VO A24 MD47 ie) P24 SDCLK_IN I AE8 
CLKMODE2 | M1 MD3 VO C25 MD48 ie) AD21 SDCLK_OUT ie) AF8 
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Signal Name | Type | Pin No. Signal Name | Type | Pin No. Signal Name | Type | Pin No. Signal Name | Type | Pin No. 
SDCLKO oO AF4 vcC2 PWR Y24 VID_DATAO oO AD4 VSS GND N1 
SDCLK1 oO AE5 VvCC2 PWR Y25 VID_DATA1 oO AF3 VSS GND N4 
SDCLK2 oO AF5 VvCC2 PWR Y26 VID_DATA2 oO AE3 VSS GND N23 
SDCLK3 oO AE4 VvCC2 PWR AC7 VID_DATA3 oO AD3 VSS GND P4 
SERIALP oO L3 VvCC2 PWR AC17 VID_DATA4 oO AC3 VSS GND P23 
SERR# OD | C12 (PU) VvCC2 PWR AD7 VID_DATA5 oO AD2 VSS GND R4 
SMl# | C19 VvCC2 PWR AD17 VID_DATA6 oO AB3 VSS GND R23 
STOP# sit/s | C11 (PU) vcCc2 PWR AE7 VID_DATA7 oO AB4 VSS GND T4 
SUSP# | H2 (PU) VvCC2 PWR AE17 VID_RDY I AD1 VSS GND T23 
SUSPA# oO E2 VvCC2 PWR AF7 VID_VAL oO M2 VSS GND V4 
SYSCLK | P26 VvCC2 PWR AF17 VOLDET oO AC5 VSS GND V23 
TCLK | J2 (PU) VCC3 PWR A10 VSS GND Al VSS GND W4 
TDI | D2 (PU) VCC3 PWR A20 VSS GND A2 VSS GND W23 
TDN oO D26 VCC3 PWR B10 VSS GND A13 VSS GND AA4 
TDO oO FA VCC3 PWR B20 VSS GND A25 VSS GND AA23 
TDP oO E24 VCC3 PWR C10 VSS GND A26 VSS GND AC4 
TEST | F3 (PD) VCC3 PWR C20 VSS GND B1 VSS GND AC6 
TESTO oO C21 VCC3 PWR D10 VSS GND B2 VSS GND AC8 
TEST1 oO B21 VCC3 PWR D20 VSS GND B25 VSS GND AC9 
TEST2 oO A23 VCC3 PWR G1 VSS GND B26 VSS GND AC11 
TEST3 oO B22 VCC3 PWR G2 VSS GND D4 VSS GND AC12 
TMS | H1 (PU) VCC3 PWR G3 VSS GND D6 VSS GND AC13 
TRDY# sit/s B9 (PU) VCC3 PWR G4 VSS GND D8 VSS GND AC14 
vCcC2 PWR A7 VCC3 PWR G23 VSS GND D9 VSS GND AC15 
VCC2 PWR Ai7 VCC3 PWR G24 VSS GND D11 VSS GND AC16 
vcc2 PWR B7 VCC3 PWR G25 VSS GND D12 VSS GND AC18 
VvcC2 PWR B17 VCC3 PWR G26 VSS GND D13 VSS GND AC19 
VvCC2 PWR C7 VCC3 PWR U1 VSS GND D14 VSS GND AC21 
VvCcC2 PWR C17 VCC3 PWR U2 VSS GND D15 VSS GND AC23 
VvCcC2 PWR D7 VCC3 PWR U3 VSS GND D16 VSS GND AE1 
VvCC2 PWR D17 VCC3 PWR U4 VSS GND D18 VSS GND AE2 
VvCC2 PWR K1 VCC3 PWR U23 VSS GND D19 VSS GND AE25 
VvCcC2 PWR K2 VCC3 PWR U24 VSS GND D21 VSS GND AE26 
vcC2 PWR K3 VCC3 PWR U25 VSS GND D23 VSS GND AF1 
VvCC2 PWR K4 VCC3 PWR U26 VSS GND F4 VSS GND AF2 
VvCcC2 PWR K23 VCC3 PWR AC10 VSS GND F23 VSS GND AF14 
VvCcC2 PWR K24 VCC3 PWR AC20 VSS GND H4 VSS GND AF25 
VvCC2 PWR K25 VCC3 PWR AD10 VSS GND H23 VSS GND AF26 
VvCC2 PWR K26 VCC3 PWR AD20 VSS GND J4 WEA# oO R25 
vCcCc2 PWR Y1 VCC3 PWR AE10 VSS GND J23 WEB# oO R24 
vcc2 PWR] Y2 VvCC3 PWR | AE20 VSS GND L4 Note: PU/PD indicates pin is 
VvCC2 PWR Y3 VCC3 PWR AF10 VSS GND L23 internally connected to 
vcC2 PWR Y4 VCC3 PWR AF20 VSS GND M4 a 20-kohm pull-up/- 
VvCC2 PwR| Y23 VID_CLK oO Pt vss GND | M23 down resistor. 


Page 15 


Cyrix . 
Media G4 Pin Assignments 


PROCESSOR 


M M X"- E N H AN € E D 


Index Corner 


12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 


fe) 1e) e (2) e) fe) {e) {e) e e) e) 19) {e) iJ e (e) 19) 
vec3. — AD25 VSS vec2 AD16 vcc3. —- STOP# = SERR#_~—VSS ADI AD8 vcc3 AD2 vec2 VSS TSTO vcc3 


e@ O Oo O Oo ie) O Oo O 9} Oo O Oo Oo ie) Oo O ie) 
vss AD27—s« CBE3#-— AD 21 AD19 = CBE2#_=— TRDY# = LOCK#—s CBE1#_— AD43 ADS AD6 AD3 SMl# AD1 TST2 MD33 MD2 
{e) O O Oo ie) O Oo e {e) e) Oo e Oo {e) ie) Oo ie) oO (e) 
vec3 AD31 AD26 AD23 vec2 AD18  FRAME# = VSS PAR vec3 AD10 Vss AD4 ADO vec2 __IRQ13 MD1 MD34 vcc3 
e) O O O e) e) O Oo O ie) {e) O Oo O 1) oO O O 
AD30 AD29 AD24 AD22 AD20 AD{7 IRDY# = PERR# = AD14 AD12 AD7 INTR TST1 TST3 MDO MD32 MD3 MD35 
ie) e) O e ie) o) e (e) oO e ie) (e) e is) ie) e ‘e) oO O 
REQO#  REQ2#  — AD28 vss vec2 vec2 VSS DEVSEL# = AD15 vss CBEO# ADS vss vec2 vec2 vss MD4 MD36 TON 
fe) O ie) e) 
GNTO# TDI MD5 TDP 
e O e e Oo e 
VSS CLKMODE2 VSS MD37 vss 
{e) O O O 
GNT2# — SUSPA# MD6 MD38 
O e O e) e O 
TDO vss TEST vec2 vss MD7 
oO O O 1e) 
REQ(# = GNT4# MD39 MD8 
o) fo) fo) fo) io) (e) 
vec2 vec2 vec2 vec2 vec2 vec2 
ie) O O ie) 
RESET _ SUSP# MD40 MD9 
{e) Oo e e Oo ie) 
vec3 TMS VSS vss MD41 vec3 
e) O O Oo 
FPVSYN —_TCK MD 10 MD42 


s 
aan? ee Ne Med iaG XT™ He id ane 


ce) O O e) 


CKMD1 — FPHSYN M MXT™ E h MD44 MD12 
ae vl pa or = n a n Cc e d i 4 es ‘ine 


Oo O O Oo 


PIX1 . PIX2 ~ Processor Ps MD15 . MD46 Ps 


vec3 VSS VSs vec3 VSS 
ie) ie) ie) oO 
PIX3 _ VID_CLK SYSCLK  MD47 


(e) oO e) {e) O 
PIX5 PIX4 WEA# = WEBH —CASA# 


320 SPGA - Top View con get 


O e O 
VSs PIX7 Dam1 vss DaM4 
O O e) 
PIX10 cs2# — DQMS 
oO e e {e) is) AA 
vcc3 PIX'1 VSs Vss cso# — VCC3 
12) O O ie) AB 
PIXI2 ——PIX13 RASB# — RASA# 
@ {e) {e) fe) io) ie) AC 
vec2  -vcC2~—s«WCC2. vec2 -veC2_—s VCC2. 
ie) O O oO AD 
CRTHSYN  DCLK MA2 MAO 
e) e fo) iJ e AE 
PIX14 vss vec2 vec2 VSS 
O O AF 
PIXIS ——PIX16 
e e) e AG 
VSS PIX17 vss 
e) O AH 
CRTVSYN  VDAT8 MAB 
O O O e @ O e oO oO e O O e io) AJ 
PCLK FLT# = VDATS VSS vec2 — MD31 VSs MD60. ——- MD57 Vss MD22.——- MD52 VSS vec2 — vec2 VSS BA1 Mag 
ie) e e) e) e) ‘e) O eo) ie) (9) ©) O ie) ie) 1e) Ss) e AK 
VRDY VSS _ VDATO _ SDCLKO SDCLK2 SDCLKIN MD29 wp27 —-MDS6. = MOSS. M21 MD20 MD50 _ MD16 = -DQM3 _—CS3# vss 
o) e) O oO io) O e) e oO ie) oO e oO oO io) ‘e) O AL 
vec2 _ VDAT4 _ VDAT2 _SDCLKi  VCC2_ _ RWCLK SDCLKOUT VSS MD58 = =~VCC3. M23 vss MD19 MD49 — ~VCC2__——«~DQM6 ~_—sCCKEA MAtt vec3 
oO e) e) (e) oO O © © je) je) O © Oo 2) ie) O 2) AM 
VDAT7 —-VDAT3. ~—sENDIS.-—s SDCLK3. = MD63.—s MD30.-—s M61 MD59 =~ MD25. = MD24. = MD53.—Ss M51 MD18 = MD48 = DQM7-~—s-DQM2_—s MA12._—s«VOLDET. 
e {e) e) e eo) oO e@ e) oO e e) O e e) e) e e) @ e AN 
Vss vec2. ——-VDAT1 VSS vec2  MD62  VCC3 M028 MD26 Vss MD54 = CKEB.—s«VCC3 MD17—- VCC2 VSS csi# — VCC3 Vss 


N <x 2<cuHuHonwowe2ze fF Re TAA MO_O wD > 
N<xX S&S < CHHODOVZAZEHACTOHOAMI.IOB > 


> b > > >} > >> DS D> 
FzteERrerTroeRmoeoaS 


123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 


Note: Signal names have been abbreviated in this figure due to space constraints. 


@ = Denotes GND terminal 
© = Denotes PWR terminal (VCC2 = VCC_CORE; VCC3 = VCC_IO) 


Figure 2-3 320 SPGA Pin Assignment Diagram 
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Pin Assignments 2 


Table 2-4 320 SPGA Pin Assignments - Sorted by Pin Number 


Pin Pin Pin Pin Pin 
No. |Signal Name No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name 
A3|VCC3 c9|VCC2 E15} DEVSEL# L35 | VCC2 U35 | VCC3 
A5| AD25 C11|AD18 E17|AD15 L37 | VCC2 U37|VSS 
A7|VSS C13 | FRAME# E19|VSS M2] RESET V2 | PIXEL3 
A9|VCC2 C15/VSS E21 | C/BEO# M4 | SUSP# V4 | VID_CLK 
A11|AD16 C17|PAR E23 |AD5 M34 | MD40 V34 | SYSCLK 
A13|VCC3 C19} VCC3 E25/VSS M36 | MD9 V36 | MD47 
A15|STOP# C21 | AD10 E27 | VCC2 N1|VCC3 W1 | PIXEL6 
A17|SERR# C23) VSS E29 | VCC2 N3| TMS W3 | PIXEL5 
A19|VSS C25 | AD4 E31|VSS N5|VSS W5 | PIXEL4 
A21|AD11 C27 | ADO E33 | MD4 N33 | VSS W33 | WEA# 
A23 | AD8& C29 | VCC2 E35 | MD36 N35 | MD41 W35 | WEB# 
A25|VCC3 C31 | IRQ13 E37|TDN N37 | VCC3 W37 | CASA# 
A27 | AD2 C33 |MD1 F2 | GNTO# P2|}FP_VSYNC X2|NC 
A29|VCC2 C35 | MD34 F4| TDI P4/TCLK X4 | PIXEL9 
A31|VSS C37 | VCC3 F34|}MD5 P34 |MD10 X34 | DQMO 
A33| TESTO D2 | AD30 F36 | TDP P36 | MD42 X36 | CASB# 
A35 | VCC3 D4 | AD29 G1|VSS Q1 | SERIALP Y1 | PIXEL8 
A37|VSS D6 | AD24 G3 | CLKMODE2 Q3/VSS Y3|VSS 
B2|VSS D8 | AD22 G5|VSS Q5|NC Y5 | PIXEL7 
B4|AD27 D10}AD20 G33) VSS Q33 |MD11 Y33 | DQM1 
B6 | C/BE3# D12|AD17 G35 | MD37 Q35|VSS Y35|VSS 
B8}AD21 D14|IRDY# G37|VSS Q37 | MD43 Y37 | DQM4 
B10|AD19 D16 | PERR# H2 | GNT2# R2 | CLKMODE1 Z2|NC 
B12 | C/BE2# D18}AD14 H4 | SUSPA# R4 | FP_HSYNG Z4 | PIXEL10 
B14| TRDY# D20}AD12 H34 | MD6 R34 | MD44 Z34 | CS2# 
B16 | LOCK# D22 | AD7 H36 | MD38 R36 | MD12 Z36 | DQM5 
B18] C/BE1# D24/INTR J1|TDO $1 | CLKMODEO AA1 | VCC3 
B20 | AD13 D26 | TEST1 J3| VSS $3 | VID_VAL AAS | PIXEL11 
B22 | AD9 D28 | TEST3 J5| TEST S5 | PIXELO AA5|VSS 
B24 | AD6 D30 | MDO J33 | VCC2 $33 | MD14 AA33 | VSS 
B26 | AD3 D32 | MD32 J35 | VSS $35 | MD13 AA35 | CSO# 
B28 | SMl# D34|MD3 J37 | MD7 $37 | MD45 AA37 | VCC3 
B30 | AD1 D36 | MD35 K2| REQ1# T2 | PIXEL1 AB2 | PIXEL12 
B32 | TEST2 E1 | REQO# K4| GNT1# T4| PIXEL2 AB4 | PIXEL13 
B34 | MD33 E3 | REQ2# K34 | MD39 T34}MD15 AB34 | RASB# 
B36 | MD2 E5|AD28 K36 | MD8 T36 | MD46 AB36 | RASA# 
C1|VCC3 E7|VSS L1|VCC2 U1;VSS AC1 | VCC2 
C3} AD31 E9| VCC2 L3 | VCC2 U3 | VCC3 AC3 | VCC2 
C5|AD26 E11} VCC2 L5|VCC2 U5|VSS AC5 | VCC2 
C7} AD23 E13|VSS L33 | VCC2 U33| VSS AC33 | VCC2 
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Table 2-4 320 SPGA Pin Assignments - Sorted by Pin Number (cont.) 


Pin Pin Pin Pin Pin 

No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name No. | Signal Name 
AC35 | VCC2 AJ3 | FTL# AK22 | MD21 AM4 | VID_DATA3 AN23 | CKEB 
AC37 | VCC2 AJ5 | VID_DATA5 AK24 | MD20 AM6 | ENA_DISP AN25 | VCC3 
AD2 | CRT_HSYNC AJ7|VSS AK26 | MD50 AM8 | SDCLK3 AN27 | MD17 
AD4 | DCLK AJ9 | VCC2 AK28 | MD16 AM10|}MD63 AN29 | VCC2 
AD34 | MA2 AJ11 | MD31 AK30 | DQM3 AM12|MD30 AN31| VSS 
AD36 | MAO AJ13| VSS AK32 | CS3# AM14|MD61 AN33 | CS1# 
AE1 | PIXEL14 AJ15 | MD60 AK34| VSS AM16|MD59 AN35 | VCC3 
AE3| VSS AJ17 | MD57 AK36 | BAO AM18}MD25 AN37|VSS 
AE5 | VCC2 AJ19| VSS AL1 | VCC2 AM20 | MD24 

AE33 | VCC2 AJ21 | MD22 AL3 | VID_DATA4 AM22 | MD53 

AE35| VSS AJ23 | MD52 AL5 | VID_DATA2 AM24 | MD51 

AE37 | MA1 AJ25| VSS AL7|SDCLK1 AM26|MD18 

AF2|PIXEL15 AJ27 | VCC2 AL9| VCC2 AM28 | MD48 

AF4 | PIXEL16 AJ29 | VCC2 AL11 | RW_CLK AM30 | DQM7 

AF34 | MA4 AJ31 | VSS AL13 | SDCLK_OUT AM832 | DQM2 

AF36 | MA3 AJ33 | BA1 AL15|VSS AM34 | MA12 

AG1/VSS AJ35 | MA9 AL17 | MD58 AM36 | VOLDET 

AG3 | PIXEL17 AJ37 | MA7 AL19| VCC3 AN1|VSS 

AG5|VSS AK2 | VID_RDY AL21 | MD23 AN3 | VCC2 
AG33 | VSS AK4|VSS AL23|VSS AN5 | VID_DATA1 
AG35 | MA5 AK6 | VID_DATAO AL25 | MD19 AN7|VSS 
AG37 | VSS AK8 | SDCLKO AL27 | MD49 ANY | VCC2 

AH2 | CRT_VSYNC AK10 | SDCLK2 AL29 | VCC2 AN11 | MD62 

AH4 | VID_DATA6 AK12 | SDCLK_IN AL31 | DQM6 AN13 | VCC3 
AH32 | MA10 AK14 | MD29 AL33 | CKEA AN15 | MD28 
AH34 | MA8 AK16 | MD27 AL35 | MA11 AN17|MD26 
AH36 | MA6 AK18 | MD56 AL37 | VCC3 AN19| VSS 

AJ1 | PCLK AK20 | MD55 AM2 | VID_DATA7 AN21 | MD54 
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Table 2-5 320 SPGA Pin Assignments - Sorted Alphabetically by Signal Name 


Signal Name | Type | Pin. No. Signal Name | Type | Pin. No. Signal Name | Type | Pin. No. Signal Name | Type | Pin. No. 
ADO VO C27 CRT_HSYNC oO AD2 MD4 VO E33 MD49 VO AL27 
AD1 VO B30 CRT_VSYNC oO AH2 MD5 VO F34 MD50 VO AK26 
AD2 VO A27 CS0# oO AA35 MD6 VO H34 MD51 VO AM24 
AD3 VO B26 CS1# oO AN33 MD7 VO J37 MD52 VO AJ23 
AD4 VO C25 CS2# oO 234 MD8 VO K36 MD53 VO AM22 
AD5 VO E23 CS3# oO AK32 MD9 VO M36 MD54 VO AN21 
AD6 VO B24 DCLK | AD4 MD10 VO P34 MD55 VO AK20 
AD7 VO D22 DEVSEL# s/t/s | E15 (PU) MD11 VO Q33 MD56 VO AK18 
AD8 VO A23 DQMO [e) X34 MD12 VO R36 MD57 VO AJ17 
AD9 VO B22 DQM1 fe) Y33 MD13 ie) $35 MD58 VO AL17 
AD10 VO C21 DQM2 [e) AM32 MD14 VO $33 MD59 VO AM16 
AD11 VO A21 DQM3 oO AK30 MD15 VO T34 MD60 fe) AJ15 
AD12 VO D20 DQM4 [e) Y37 MD16 VO AK28 MD61 VO AM14 
AD13 VO B20 DQM5 [e) Z36 MD17 VO AN27 MD62 VO AN11 
AD14 VO D18 DQM6 (eo) AL31 MD18 ie) AM26 MD63 VO AM10 
AD15 VO E17 DQM7 (e) AM30 MD19 VO AL25 NC Q5 
AD16 VO Alt ENA_DISP oO AM6 MD20 VO AK24 NC X2 
AD17 VO D12 FLT# | AJ3 MD21 ie) AK22 NC Z2 
AD18 VO C11 FP_HSYNC oO R4 MD22 VO AJ21 PAR VO C17 
AD19 VO B10 FP_VSYNC (e) P2 MD23 VO AL21 PCLK oO AJ1 
AD20 ie) D10 FRAME# s/t/s | C13 (PU) MD24 ie) AM20 PERR# s/t/s | D16 (PU) 
AD21 VO B8 GNTO# oO F2 MD25 ie) AM18 PIXELO oO $5 
AD22 VO D8 GNT1# oO K4 MD26 VO AN17 PIXEL1 oO T2 
AD23 VO C7 GNT2# [e) H2 MD27 VO AK16 PIXEL2 {e) T4 
AD24 VO D6 INTR | D24 MD28 VO AN15 PIXEL3 oO v2 
AD25 VO AS IRDY# s/t/s | D14 (PU) MD29 ie) AK14 PIXEL4 oO W5 
AD26 VO C5 IRQ13 [e) C31 MD30 ie) AM12 PIXEL5 oO W3 
AD27 VO B4 LOCK# s/t/s | B16 (PU) MD31 VO AJ11 PIXEL6 (e) W1 
AD28 VO E5 MAO (e) AD36 MD32 VO D32 PIXEL7 [e) Y5 
AD29 VO D4 MA1 (eo) AE37 MD33 ie) B34 PIXEL8 (e) Y1 
AD30 VO D2 MA2 oO AD34 MD34 VO C35 PIXEL9 oO x4 
AD31 VO C3 MA3 oO AF36 MD35 VO D36 PIXEL10 (e) Z4 
BAO (eo) AK36 MA4 oO AF34 MD36 ie) E35 PIXEL11 oO AA3 
BA1 fe) AJ33 MAS oO AG35 MD37 ie) G35 PIXEL12 [e) AB2 
CASA# [e) W37 MA6 oO AH36 MD38 VO H36 PIXEL13 oO AB4 
CASB# oO X36 MA7 [e) AJ37 MD39 VO K34 PIXEL14 oO AE1 
C/BEO# VO E21 MA8 oO AH34 MD40 VO M34 PIXEL15 oO AF2 
C/BE1# VO B18 MAQ oO AJ35 MD41 VO N35 PIXEL16 oO AF4 
C/BE2# VO B12 MA10 oO AH32 MD42 VO P36 PIXEL17 oO AG3 
C/BE3# fe) B6 MA11 oO AL35 MD43 VO Q37 RASA# oO AB36 
CKEA oO AL33 MA12 oO AM34 MD44 VO R34 RASB# oO AB34 
CKEB oO AN23 MDO VO D30 MD45 ie) S37 REQO# I E1 (PU) 
CLKMODEO | S1 MD1 VO C33 MD46 ie) T36 REQ1# I K2 (PU) 
CLKMODE1 | R2 MD2 VO B36 MD47 VO V36 REQ2# I E3 (PU) 
CLKMODE2 | G3 MD3 VO D34 MD48 VO AM28 RESET I M2 
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Table 2-5 320 SPGA Pin Assignments - Sorted Alphabetically by Signal Name (cont.) 


Signal Name | Type | Pin. No. Signal Name | Type | Pin. No. Signal Name | Type | Pin. No. Signal Name | Type | Pin. No. 

RW_CLK fe) AL11 VvCC2 PWR | AC33 vss GND A7 vss GND | AL23 

SDCLK_IN | AK12 VCC2 PWR | AC35 vss GND A19 vss GND AN1 

SDCLK_OUT | O AL13 VCC2 PWR | AC37 vss GND A31 vss GND AN7 

SDCLKO (0) AK8 VvCC2 PWR AE5 vss GND A37 vss GND | AN19 

SDCLK1 (0) AL7 VCC2 PWR | AE33 vss GND B2 vss GND | AN31 

SDCLK2 (0) AK10 VCC2 PWR AJ vss GND C15 vss GND | AN37 

SDCLK3 (0) AM8 VvCC2 PWR | AJ27 vss GND C23 WEA# (0) W33 

SERIALP fo) Qi VvCC2 PWR | AJ29 vss GND E7 WEB# (0) W35 

SERR# OD | A17 (PU) | | vCC2 PWR ALI vss GND E13 

SMl# ! B28 VCC2 PwR | ALg vss GND | £19 Note: PU/PD indicates pin is 

STOP# sits | A15 (PU) | | vcc2 pwr] at29 | |vss GND | E25 internalconnecied 10 
a 20-kohm pull-up/ 

SUSP# | M4 (PU) | | VCC2 PWR AN3 vss GND E31 down resistor 

SUSPA# (o) H4 VvCC2 PWR ANQ vss GND G1 

SYSCLK | V34 VvCC2 PWR | AN29 vss GND G5 

TCLK | P4 (PU) VCC3 PWR A3 vss GND G33 

TDI | F4 (PU) VCC3 PWR A13 vss GND G37 

TDN fe) E37 VCC3 PWR A25 vss GND J3 

TDO oO J1 VCC3 PWR A35 VSS GND J35 

TDP (0) F36 VCC3 PWR C1 vss GND N5 

TEST | J5 (PD) VCC3 PWR C19 vss GND N33 

TESTO oO A33 VCC3 PWR C37 VSS GND Q3 

TEST1 (@) D26 VCC3 PWR N1 vss GND Q35 

TEST2 (0) B32 VCC3 PWR N37 vss GND Ut 

TEST3 oO D28 VCC3 PWR U3 VSS GND U5 

TMS | N3 (PU) VCC3 PWR U35 VSS GND U33 

TRDY# sits | B14 (PU) | | VCC3 PWR AAI vss GND U37 

VvCC2 PWR AQ VCC3 PWR | AA37 vss GND Y3 

VCC2 PWR A29 VCC3 PWR | AL19 vss GND Y35 

VvCC2 PWR cg VCC3 PWR | AL37 vss GND AA5 

VvCC2 PWR c29 VCC3 PWR | AN13 vss GND | AA33 

VvCC2 PWR E9 VCC3 PWR | AN25 vss GND AE3 

VCC2 PWR E11 VCC3 PWR | AN35 vss GND | AE35 

VvCC2 PWR E27 VID_CLK fe) V4 vss GND AGI 

VvCC2 PWR E29 VID_DATAO (0) AK6 vss GND AG5 

VCC2 PWR J33 VID_DATA1 (0) AN5 vss GND | AG33 

VvCC2 PWR u VID_DATA2 fe) AL5 vss GND | AG37 

VCC2 PWR L3 VID_DATA3 fe) AM4 vss GND AJ7 

VvCC2 PWR L5 VID_DATA4 (0) AL3 vss GND | AuJ13 

VCC2 PWR L33 VID_DATA5 (0) AJ5 vss GND | AJ19 

VCC2 PWR L35 VID_DATA6 (0) AH4 vss GND | AJ25 

vcc2 PWR L37 VID_DATA7 fe) AM2 vss GND | AJ31 

VvCC2 PWR AC1 VID_RDY | AK2 vss GND AK4 

VCC2 PWR AC3 VID_VAL (0) $3 vss GND | AK34 

VCC2 PWR AC5 VOLDET (0) AM36 vss GND | AL15 
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2.2 Signal Descriptions 


2.2.1 System Interface Signals 


Signal Name 


BGA 


Pin No. 


SPGA 
Pin No. 


Type 


Description 


SYSCLK 


P26 


V34 


System Clock 


System Clock runs synchronously with the PCI bus. The inter- 

nal clock of the MediaGX processor is generated by an internal 
PLL which multiplies the SYSCLK input and can run up to eight 
times faster. The SYSCLK to core clock multiplier is configured 
using the CLKMOD[2:0] inputs. 


The SYSCLK input is a fixed frequency which can only be 
stopped or varied when the MediaGX processor is in a full 3V 
Suspend. (Section 6.4 “3-Volt Suspend Mode” on page 203 for 
details regarding this mode.) 


CLKMODE[2:0] 


M1, L1, 
M3 


G3, R2, 
$1 


Clock Mode 


These signals are used to set the core clock multiplier. The PCI 
clock "SYSCLK" is multiplied by the value programmed by 
CLKMODE/[2:0] to generate the MediaGX processor’s core 
clock. CLKMODE2 is valid only for MediaGX MMX-Enhanced 
processor revision 4.0 and up. The value read from DIR1 
(Device ID Register 1, refer to page 56) affects the definition of 
the CLKMODE pins. 


If DIR1 = 30h-33h then CLKMODE[1:0]: 

00 = SYSCLK multiplied by 4 (Test mode only) 
01 = SYSCLK multiplied by 6 

10 = SYSCLK multiplied by 7 

11 = SYSCLK multiplied by 5 


If DIR1 = 34h-4Fh then CLKMODE[1:0]: 

00 = SYSCLK multiplied by 4 (Test mode only) 
01 = SYSCLK multiplied by 6 

10 = SYSCLK multiplied by 7 

11 = SYSCLK multiplied by 8 


If DIR1 > or = 50h then CLKMODE[2:0]: 

000 = SYSCLK multiplied by 4 (Test mode only) 
001 = SYSCLK multiplied by 10 

010 = SYSCLK multiplied by 9 

011 = SYSCLK multiplied by 5 

100 = SYSCLK multiplied by 4 

101 = SYSCLK multiplied by 6 

110 = SYSCLK multiplied by 7 

111 = SYSCLK multiplied by 8 
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2.2.1 System Interface Signals (cont.) 


BGA SPGA 
Signal Name Pin No. | Pin No. Type _ | Description 
RESET J3 M2 | Reset 


RESET aborts all operations in progress and places the 
MediaGX processor into a reset state. RESET forces the CPU 
and peripheral functions to begin executing at a known state. 
All data in the on-chip cache is invalidated. 


RESET is an asynchronous input but must meet specified 
setup and hold times to guarantee recognition at a particular 
clock edge. This input is typically generated during the Power- 
On-Reset sequence. 


Note: Warm Reset does not require an input on the MediaGX 
processor since the function is virtualized using SMM. 


INTR Bi8 D24 (Maskable) Interrupt Request 

INTR is a level-sensitive input that causes the MediaGX pro- 
cessor to Suspend execution of the current instruction stream 
and begin execution of an interrupt service routine. The INTR 
input can be masked through the Flags Register IF bit. (See 
Table 3-4 "EFLAGS Register" on page 45 for bit definitions.) 


IRQ13 C22 C31 O Interrupt Request Level 13 
IRQ13 is asserted if an on-chip floating point error occurs. 


When a floating point error occurs, the MediaGX processor 
asserts the IRQ13 pin. The floating point interrupt handler then 
performs an OUT instruction to I/O address FOh or Fth. The 
MediaGX processor accepts either of these cycles and clears 
the IRQ13 pin. 


Refer to Section 3.4.1 “I/O Address Space” on page 65 for fur- 
ther information on IN/OUT instructions. 


SMI# C19 B28 System Management Interrupt 


SMIf# is a level-sensitive interrupt. SMI# puts the MediaGX pro- 
cessor into System Management Mode (SMM). 
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2.2.1 System Interface Signals (cont.) 


BGA SPGA 
Signal Name Pin No. | Pin No. Type | Description 


SUSP# H2 M4 | Suspend Request 


(PU) (PU) This signal is used to request that the MediaGX processor 


enter Suspend mode. After recognition of an active SUSP# 
input, the processor completes execution of the current instruc- 
tion, any pending decoded instructions and associated bus 
cycles. SUSP# is ignored following RESET# and is enabled by 
setting the SUSP bit in CCR2. (See Table 3-11 "Configuration 
Registers" on page 52 for CCR2 bit definitions.) 


Since the MediaGX processor includes system logic functions 
as well as the CPU core, there are special modes designed to 
support the different power management states associated with 
APM, ACPI, and portable designs. The part can be configured 
to stop only the CPU core clocks, or all clocks. When all clocks 
are stopped, the external clock can also be stopped. (See Sec- 
tion 6 “Power Management” on page 201 for more details 
regarding power management states.) 


This pin is internally connected to a 20-kohm pull-up resistor. 
SUSP# is pulled up when not active. 


SUSPA# E2 H4 Oo Suspend Acknowledge 


Suspend Acknowledge indicates that the MediaGX processor 
has entered low-power Suspend mode as a result of SUSP# 
assertion or execution of a HALT instruction. SUSPA# floats fol- 
lowing RESET# and is enabled by setting the SUSP bit in 
CCR2. (See Table 3-11 "Configuration Registers" on page 52 
for CCR2 bit definitions.) 


The SYSCLK input may be stopped after SUSPA# has been 
asserted to further reduce power consumption if the system is 
configured for 3V Suspend mode. (Section 6.4 “3-Volt Suspend 
Mode” on page 203 for details regarding this mode.) 


SERIALP L3 Ql O Serial Packet 


Serial Packet is the single wire serial-transmission signal to the 
Cx5520 chip. The clock used for this interface is the PCI clock 
(SYSCLK). This interface carries packets of miscellaneous 
information to the chipset to be used by the VSA software han- 
dlers. 
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2.2.2 PCI Interface Signals 


BGA SPGA 
Signal Name Pin No. | Pin No Type _ | Description 
AD[31:0] Refer Refer /O Multiplexed Address and Data 
to Table | to Table Addresses and data are multiplexed on the same PCI pins. A 
2-3 2-5 bus transaction consists of an address phase in the cycle in 
which FRAME is asserted followed by one or more data 
phases. During the address phase, AD[31:0] contain a physical 
32-bit address. For I/O, this is a byte address, for configuration 
and memory it is a DWORD address. During data phases, 
AD[7:0] contain the least significant byte (LSB) and AD[31:24] 
contain the most significant byte (MSB). Write data is stable 
and valid when IRDY# is asserted and read data is stable and 
valid when TRDY# is asserted. Data is transferred during those 
SYSCLKS where both IRDY# and TRDY# are asserted. 
C/BE[3:0]# D5, B6, B12, /0 Multiplexed Command and Byte Enables 
B8, B18, Bus command and byte enables are multiplexed on the same 
C13, E21 PCI pins. During the address phase of a transaction when 
A1S FRAME+# is active, C/BE[3:0]# define the bus command. During 
the data phase C/BE[3:0]# are used as byte enables. The byte 
enables are valid for the entire data phase and determine which 
byte lanes carry meaningful data. C/BEO# applies to byte 0 
(LSB) and C/BE3# applies to byte 3 (MSB). 
The command encoding and types are listed below. 
0000 = Interrupt Acknowledge 
0001 = Special Cycle 
0010 = I/O Read 
0011 = I/O Write 
0100 = Reserved 
0101 = Reserved 
0110 = Memory Read 
0111 = Memory Write 
1000 = Reserved 
1001 = Reserved 
1010 = Configuration Read 
1011 = Configuration Write 
1100 = Memory Read Multiple 
1101 = Dual Address Cycle (Reserved) 
1110 = Memory Read Line 
1111 = Memory Write and Invalidate 
ei} 
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2.2.2 PCI Interface Signals 


(cont.) 


Signal Name 


BGA 
Pin No. 


SPGA 
Pin No 


Type 


Description 


PAR 


B12 


C17 


/O 


Parity 

Parity generation is required by all PCI agents: the master 
drives PAR for address and write-data phases, the target drives 
PAR for read-data phases. Parity is even across AD[31:0] and 
C/BE[3:0]#. 


For address phases, PAR is stable and valid one SYSCLK after 
the address phase. It has the same timing as AD[81:0] but 
delayed by one SYSCLK. 


For data phases, PAR is stable and valid one SYSCLK after 
either IRDY# is asserted on a write transaction or after TRDY# 
is asserted on a read transaction. Once PAR is valid, it remains 
valid until one SYSCLK after the completion of the data phase. 
(Also see PERR#.) 


FRAME# 


A8 
(PU) 


C13 
(PU) 


sit/s 


Frame 


Cycle Frame is driven by the current master to indicate the 
beginning and duration of an access. FRAME is asserted to 
indicate a bus transaction is beginning. While FRAME# is 
asserted, data transfers continue. When FRAME is deas- 
serted, the transaction is in the final data phase. 


This pin is internally connected to a 20-kohm pull-up resistor. 


IRDY# 


cg 
(PU) 


D14 
(PU) 


sit/s 


Initiator Ready 


Initiator Ready is asserted to indicate that the bus master is 
able to complete the current data phase of the transaction. 
IRDY# is used in conjunction with TRDY#. A data phase is 
completed on any SYSCLK in which both IRDY# and TRDY# 
are sampled asserted. During a write, IRDY# indicates valid 
data is present on AD[31:0]. During a read, it indicates the mas- 
ter is prepared to accept data. Wait cycles are inserted until 
both IRDY# and TRDY# are asserted together. 


This pin is internally connected to a 20-kohm pull-up resistor. 


TRDY# 


B14 
(PU) 


sit/s 


Target Ready 


TRDY# is asserted to indicate that the target agent is able to 
complete the current data phase of the transaction. TRDY# is 
used in conjunction with IRDY#. A data phase is complete on 
any SYSCLK in which both TRDY# and IRDY# are sampled 
asserted. During a read, TRDY# indicates that valid data is 
present on AD[31:0]. During a write, it indicates the target is 
prepared to accept data. Wait cycles are inserted until both 
IRDY# and TRDY# are asserted together. 


This pin is internally connected to a 20-kohm pull-up resistor. 
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(cont.) 


Signal Name 


BGA 
Pin No. 


SPGA 
Pin No 


Type 


Description 


STOP# 


C11 
(PU) 


A15 
(PU) 


si/t/s 


Target Stop 


STOP# is asserted to indicate that the current target is request- 
ing the master to stop the current transaction. This signal is 
used with DEVSEL+# to indicate retry, disconnect or target 
abort. If STOP# is sampled active while a master, FRAME# will 
be deasserted and the cycle stopped within three SYSCLK 
cycles. As an input, STOP# can be asserted in the following 
cases. 1) If a PCI master tries to access memory that has been 
locked by another master. This condition is detected if FRAME# 
and LOCK# are asserted during an address phase. 2) STOP# 
will also be asserted if the PCI write buffers are full or if a previ- 
ously buffered cycle has not completed. 3) Finally, STOP# can 
be asserted on read cycles that cross cache line boundaries. 
This is conditional based upon the programming of bit 1 in PCI 
Control Function 2 Register. (See Table 4-38 "PCI Configura- 
tion Registers" on page 179 for programming details.) 


This pin is internally connected to a 20-kohm pull-up resistor. 


LOCK# 


B11 
(PU) 


B16 
(PU) 


sit/s 


Lock Operation 


LOCK# indicates an atomic operation that may require multiple 
transactions to complete. When LOCK# is asserted, nonexclu- 
sive transactions may proceed to an address that is not cur- 
rently locked (at least 16 bytes must be locked). A grant to start 
a transaction on PCI does not guarantee control of LOCK#. 
Control of LOCK# is obtained under it own protocol in conjunc- 
tion with GNT#. It is possible for different agents to use PCI 
while a single master retains ownership of LOCK#. The arbiter 
can implement a complete system lock. In this mode, if LOCK# 
is active, no other master can gain access to the system until 
the LOCK# is deasserted. 


This pin is internally connected to a 20-kohm pull-up resistor. 


DEVSEL# 


AQ 
(PU) 


E15 
(PU) 


s/t/s 


Device Select 


DEVSEL# indicates that the driving device has decoded its 
address as the target of the current access. As an input, 
DEVSEL+# indicates whether any device on the bus has been 
selected. DEVSEL# will also be driven by any agent that has 
the ability to accept cycles on a subtractive decode basis. As a 
master, if no DEVSEL+# is detected within and up to the subtrac- 
tive decode clock, a master abort cycle will result expect for 
special cycles which do not expect a DEVSEL# returned. 


This pin is internally connected to a 20-kohm pull-up resistor. 
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2.2.2 PCI Interface Signals (cont.) 


BGA SPGA 
Signal Name Pin No. | Pin No Type _ | Description 


PERR# Al1 D16 s/t/s Parity Error 


(PU) (PU) PERR is used for reporting of data parity errors during all PCI 


transactions except a Special Cycle. The PERR# line is driven 
two SYSCLKs after the data in which the error was detected. 
This is one SYSCLK after the PAR that is attached to the data. 
The minimum duration of PERR# is one SYSCLK for each data 
phase in which a data parity error is detected. PERR# must be 
driven high for one SYSCLK before being tristated. A target 
asserts PERR# on write cycles if it has claimed the cycle with 
DEVSEL#. The master asserts PERR# on read cycles. 


This pin is internally connected to a 20-kohm pull-up resistor. 


SERR# C12 A17 OD System Error 


(PU) (PU) System Error may be asserted by any agent for reporting errors 


other than PCI parity. The intent is to have the PCI central 
agent assert NMI to the processor. When the Parity Enable bit 
is set in the Memory Controller Configuration register, SERR# 
will be asserted upon detecting a parity error on read opera- 
tions from DRAM. 


REQ[2:0]# D3, ES, Request Lines 


Request indicates to the arbiter that an agent desires use of the 
bus. Each master has its own REQ# line. REQ# priorities are 
based on the arbitration scheme chosen. 


Each of these pins are internally connected to a 20-kohm pull- 
up resistor. 


GNT[2:0]# El, H2, O Grant Lines 


F2, K4, 
D1 F2 


Grant indicates to the requesting master that it has been 
granted access to the bus. Each master has its own GNT# line. 
GNT# can be pulled away at any time a higher REQ# is 
received or if the master does not begin a cycle within a mini- 
mum period of time (16 SYSCLKs). 
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2.2.3 Memory Controller Interface Signals 


BGA SPGA 
Signal Name Pin No. | Pin No. Type _ | Description 
Note: The memory controller interface supports two types of memory configurations: SDRAM modules on the sys- 
tem board and JEDEC DIMM connectors. Refer to Section 4.3 “Memory Controller” on page 116 for detailed 
information regarding signal connections. 
MDJ[63:0] Refer Refer /O Memory Data Bus 
to La to we The data bus lines driven to/from system memory. 
MA[12:0] Refer Refer oO Memory Address Bus 
to Table | to Table The multiplexed row/column address lines driven to the system 
23 25 memory. 
Supports 256Mbit SDRAM. 
BA[1:0] AD26, AJ33, oO Bank Address Bits 
AD25 AK36 These bits are used to select the component bank within the 
SDRAM. 
CS[3:0]# AE23, AK82, oO Chip Selects 
V25, 234, The chip selects are used to select the module bank within the 
AD23, ANS3, system memory. Each chip select corresponds to a specific 
V26 AASS module bank. 
If CS# is high, the bank(s) do not respond to RAS#, CAS#, 
WE# until the bank is selected again. 
RASA#, W24, AB36, oO Row Address Strobe 
RASB# Wes AB34 RAS#, CAS#, WE# and CKE are encoded to support the differ- 
ent SDRAM commands. RASA+# is used with CS[1:0]#. RASB# 
is used with CS[3:2]#. 
CASA#, P25, W37, oO Column Address Strobe 
CASB# R26 X36 RAS#, CAS#, WE# and CKE are encoded to support the differ- 
ent SDRAM commands. CASA+# is used with CS[1:0]#. CASB# 
is used with CS[3:2]#. 
WEA#, R25, W33, oO Write Enable 
WEB# R24 W35 RAS#, CAS#, WE# and CKE are encoded to support the differ- 
ent SDRAM commands. WEA+# is used with CS[1:0]#. WEB# is 
used with CS[3:2]#. 
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2.2.3 Memory Controller Interface Signals (cont.) 


BGA SPGA 
Signal Name Pin No. | Pin No. Type _ | Description 
DQM[7:0] Refer Refer oO Data Mask Control Bits 
to Table | to Table During memory read cycles, these outputs control whether the 
2-3 2-5 SDRAM output buffers are driven on the MD bus or not. All 
DQM signals are asserted during read cycles. 
During memory write cycles, these outputs control whether or 
not MD data will be written into the SDRAM. 
DQM[7:0] connect directly to the DQM7-0 pins of each connec- 
tor. 
CKEA, AF24, AL33, oO Clock Enable 
CKEB AD16 AN23 These signals are used to enter Suspend/power-down mode. 
When CKE goes low when no read or write cycle is in progress, 
the SDRAM enters power-down mode. To ensure that SDRAM 
data remains valid, the self-refresh command is executed. To 
exit this mode, drive CKE high. 
For normal operation, CKE should be held high. 
SDCLK[3:0] AE4, AM8, oO SDRAM Clocks 
AFS, AK10, The SDRAM samples all the control, address, and data using 
AES, AL?, these clocks. SDCLK[3:0] should be used with CS[3:0}#, 
AF4 AK8 respectively, for the Suspend mode to function correctly. 
SDCLK_IN AE8 AK12 SDRAM Clock Input 
The MediaGX processor samples the memory read data on this 
clock. Works in conjunction with the SDCLK_OUT signal. 
SDCLK_OUT AF8 AL13 oO SDRAM Clock Output 
This output is routed back to SDCLK_IN. The board designer 
should vary the length of the board trace to control skew 
between SDCLK_IN and SDCLK. 
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2.2.4 Video Interface Signals 


BGA SPGA 

Signal Name Pin No | Pin No Type _ | Description 

PCLK AC1 AJ1 O Pixel Port Clock 
Pixel Port Clock represents the pixel dotclock or a 2x multiple of 
the dotclock for some 16-bit-per-pixel modes. It determines the 
data transfer rate from the MediaGX processor to the 
Cx5520/Cx5530. 

VID_CLK P1 V4 O Video Clock 
Video Clock represents the video port clock to the 
Cx5520/Cx5530. This pin is only used if the Video Port is 
enabled. 

DCLK AB1 AD4 Dotclock 
The DCLK input is driven from the Cx5520/Cx5530 and repre- 
sents the pixel dot clock. In some cases, such as when display- 
ing 16 BPP data with an eight-bit-graphics pixel port, this clock 
will actually be a 2x multiple of the dotclock. 

CRT_HSYNC Ww2 AD2 O CRT Horizontal Sync 


CRT Horizontal Sync establishes the line rate and horizontal 
retrace interval for an attached CRT. The polarity is program- 
mable and depends on the display mode. 


CRT_VSYNC AA3 AH2 O CRT Vertical Syne 


CRT Vertical Sync establishes the screen refresh rate and verti- 
cal retrace interval for an attached CRT. The polarity is pro- 
grammable and depends on the display mode. 


FP_HSYNC L2 R4 O Flat Panel Horizontal Sync 


Flat Panel Horizontal Sync establishes the line rate and hori- 
zontal retrace interval for a TFT display. Polarity is programma- 
ble and depends on the display mode. 


This signal is an input to the Cx5520/Cx5530. The 
Cx5520/Cx5530 re-drives this signal to the flat panel. 


If no flat panel is used in the system, this signal does not need 
to be connected. 


FP_VSYNC J1 P2 O Flat Panel Vertical Sync 


Flat Panel Vertical Sync establishes the screen refresh rate and 
vertical retrace interval for a TFT display. Polarity is program- 
mable and depends on the display mode. 


This signal is an input to the Cx5520/Cx5530. The 
Cx5520/Cx5530 re-drives this signal to the flat panel. 


If no flat panel is used in the system, this signal does not need 
to be connected. 
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2.2.4 Video Interface Signals (cont.) 


BGA SPGA 
Signal Name Pin No | Pin No Type _ | Description 
ENA_DISP AD5 AM6 O Display Enable 


Display Enable indicates the active display portion of a scan 
line to the Cx5520/Cx5530. 


In a Cx5520/Cx5530-based system, this signal is required to be 
connected even if there is no TFT panel in the system. 


VID_RDY AD1 AK2 Video Ready 


This input signal indicates that the video FIFO in the 
Cx5520/Cx5530 is ready to receive more data. 


VID_VAL M2 $3 oO Video Valid 
VID_VAL qualifies valid video data to the Cx5520/Cx5530. 
VID_DATA|7:0] Refer Refer oO Video Data Bus 
to Table | to Table When the Video Port is enabled, this bus drives Video (Y-U-V) 
2-3 2-5 data synchronous to the VID_CLK output. 
PIXEL[17:0] Refer Refer oO Graphics Pixel Data Bus 
to Table | to Table 


This bus drives graphics pixel data synchronous to the PCLK 


2-3 2-5 output. 
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2.2.5 Power, Ground, and No Connect Signals 


BGA SPGA 
Signal Name Pin No. | Pin No. Type_ | Description 
VOLDET AC5 AM36 O Voltage Detect 
In early schematic revisions this pin was identified as VOLDET. 
However, in the production version this pin is a "no connect" 
and should be left disconnected. 
VSS Refer Refer GND _ | Ground Connection 
to Table | to Table 
2-3 2-5 
(Total of | (Total of 
71) 50) 
VCC2 Refer Refer PWR _| 2.9V (nominal) Core Power Connection 
to Table | to Table 
2-3 2-5 
(Total of | (Total of 
32) 32) 
VCC3 Refer Refer PWR_| 3.3V (nominal) I/O Power Connection 
to Table | to Table 
2-3 2-5 
(Total of | (Total of 
32) 18) 
NC -- Q5, X2, No Connection 
Ze A line designated as NC should be left disconnected. 
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2.2.6 Cyrix Internal Test and Measurement Signals 


BGA SPGA 
Signal Name Pin No. | Pin No. Type _ | Description 
FLT# AC2 AJ3 | Float 


Float Outputs forces the MediaGX processor to float all outputs 
in the high-impedance state and to enter a power-down state. 


RW_CLK AE6 AL11 O Raw Clock 


This output is the MediaGX processor clock. This debug signal 
can be used to verify clock operation. 


TEST[3:0] B22, D28, O SDRAM Test Outputs 
A23, B32, These outputs are used for internal debug only. 
B21, D26, 
C21 A33 
TCLK J2 P4 Test Clock 
(PU) (PU) JTAG test clock. 


This pin is internally connected to a 20-kohm pull-up resistor. 


TDI D2 F4 Test Data Input 


(PU) (PU) JTAG serial test-data input. 
This pin is internally connected to a 20-kohm pull-up resistor. 
TDO Ft J1 O Test Data Output 
JTAG serial test-data output. 
TMS H1 N3 Test Mode Select 
(PU) (PU) JTAG test-mode select. 
This pin is internally connected to a 20-kohm pull-up resistor. 
TEST F3 J5 Test 
(PD) (PD) 


Test-mode input. 
This pin is internally connected to a 20-kohm pull-down resistor. 


TDP E24 F36 O Thermal Diode Positive 


TDP is the positive terminal of the thermal diode on the die. The 
diode is used to do thermal characterization of the device ina 
system. This signal works in conjunction with TDN. 


TDN D26 E37 O Thermal Diode Negative 


TDN is the negative terminal of the thermal diode on the die. 
The diode is used to do thermal characterization of the device 
in a system. This signal works in conjunction with TDP. 
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2.3 Subsystem Signal Connections 


As previously stated, the MediaGX Integrated Chip. Figure 2-4 shows the signal connections 
Subsystem with MMX support consists of two between the processor and the I/O companion 
chips. The MediaGX MMX-Enhanced Processor chip. 

and either the Cx5520 or Cx5530 I/O Companion 


SYSCLK GX_CLK 
SERIALP PSERIAL 
IRQ13 IRQ13 
SMI# SMI# 
PCLK PCLK 
DCLK DCLK 
CRT_HSYNC HSYNC 
Exclusive CRT_VSYNC VSYNC 


Interconn 
igreones < PIXEL[17:0] PIXEL[23:0] 


Signals 
(Do not connect to FP_HSYNC Se Ne Not needed if 
any other device) FP_VSYNC FP_VSYNC J CRT only (no TFT) 
ENA _DISP FP_ENA_DISP 
VID_VAL VID_VAL 
VID_CLK VID_CLK 
VID_DATA[7:0] VID_DATA[7:0] 
VID_RDY VID_RDY 
RESET CPU_RST 
INTR INTR 


Cx5520/Cx5530 
1/0 Companion 


MediaGX™ 
MMX™-Enhanced eueee ieee 

Processor SUSPA# SUSPA# 

AD[31:0 AD[31:0] 


C/BE[3:0] C/BE[3:0}# 
PAR 


Nonexclusive 
Interconnect 

Signals < 

(May also connect 

to other circuitry) 


Note: Refer to Figure 2-5 for interconnection of these lines. 


FRAME# 
IRDY# 
TRDY# 
STOP# 
LOCK# 
DEVSEL# 
PERR# 
SERR# 
REQ# 
GNT# 


Figure 2-4 Subsystem Signal Connections 
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PIXEL17 PIXEL23 
PIXEL16 PIXEL22 
PIXEL15 PIXEL21 
PIXEL14 PIXEL20 
PIXEL13 PIXEL19 
PIXEL12 PIXEL18 
PIXEL17 
PIXEL16 
PIXEL11 PIXEL15 
PIXEL10 PIXEL14 
PIXEL9 PIXEL13 
PIXEL8 PIXEL12 
PIXEL7 PIXEL11 
PIXEL6 PIXEL10 
PIXEL9 
PIXEL8 
PIXEL5 PIXEL7 
PIXEL4 PIXEL6 
PIXEL3 PIXELS 
PIXEL2 PIXEL4 
PIXEL1 PIXEL3 
PIXELO PIXEL2 
PIXEL1 
PIXELO 


MediaGX™ 
MMX™-Enhanced 
Processor 


Cx5520/Cx5530 
1/0 Companion 


Figure 2-5 PIXEL Signal Connections 
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2.4 Power Planes 


Figure 2-6 shows layout recommendations for split- assumes there is one power plane, and no compo- 
ting the power plane between 2.9 (Veco) and 3.3 nents on the back of the board. 
(Vcec3) volts in the BGA package. The illustration 


3.3V Plane 
(VCC3) 


2.9V Plane 
(VCC2) 


3.3V Plane 


(VCC3) MediaGX™ 
MMX™-Enhanced 
Processor 


352 BGA - Top View 


2.9V Plane 


Legend 
3.3V Plane 
(VCC3) 


ONIDIO = High frequency capacitor 
OX JE4O = 220uF, low ESR capacitor 
@ =3.3V connection 
© =2.9V connection 


Figure 2-6 BGA Recommended Split Power Plane and Decoupling 
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Figure 2-7 shows layout recommendations for split- (Vcec3) volts in the SPGA package. 
ting the power plane between 2.9 (Veco) and 3.3 


3.3V Plane . 
(VCC3) 


‘eo! g 2.9V Plane 
(VCC2) 


MediaGX™ 
MMX™-Enhanced a ele 
3.3V Plane Processor 
(VCC3) 


320 SPGA - Top View 


2.9V Plane 


AN 


' To 2.9V 37 
: Regulator Neca 
Legend 
QOD = High frequency capacitor 
CALIENDO == 220uF, low ESR capacitor 
@ = 3.3V connection Note: Where signals cross plane splits, it is recommended to include 
© =2.9V connection AC decoupling between planes with 47pF capacitors. 


Figure 2-7 SPGA Recommended Split Power Plane and Decoupling 


GXm_db_v2.0 Cyrix Corporation Confidential Page 37 


Cyrix : 
Medial9x 


M M X"*- E N H AN € E D 


Page 38 Cyrix Corporation Confidential GXm_db_v2.0 


MediaGX™ MMX™-Enhanced Processor 


ialOXK 


M M X" = N H A N € E D 


3 Processor Programming 


This section describes the internal operations of 

the MediaGX MMX-Enhanced processor from a 

programmer’s point of view. It includes a descrip- 
tion of the traditional “core” processing and FPU 

operations. The integrated function registers are 

described at the end of this chapter. 


The primary register sets within the processor core 
include: 


* Application Register Set 

* System Register Set 

* Model Specific Register Set 

* Floating Point Unit Register Set. 


The initialization of the major registers within in 
core are shown in Table 3-1 on page 40. 


The integrated function sets are located in main 
memory space and include: 


« Internal Bus Interface Unit Register Set 
* Graphics Pipeline Register Set 

* Display Controller Register Set 

* Memory Controller Register Set 

* Power Management Register Set 


Integrated x86 Solution with MMX™ Support 


Ley 


3.1 Core Processor Initialization 


The MediaGxX processor is initialized when the 
RESET signal is asserted. The processor is placed 
in real mode and the registers listed in Table 3-1 
are set to their initialized values. RESET invali- 
dates and disables the CPU cache, and turns off 
paging. When RESET is asserted, the CPU termi- 
nates all local bus activity and all internal execu- 
tion. During the entire time that RESET is asserted, 
the internal pipeline is flushed and no instruction 
execution or bus activity occurs. 


Approximately 150 to 250 external clock cycles 
after RESET is deasserted, the processor begins 
executing instructions at the top of physical 
memory (address location FFFF FFFOh). The actual 
time depends on the clock scaling in use. Also, an 
additional 2°° clock cycles are needed when self- 
test is requested. 


Typically, an intersegment jump is placed at FFFF 
FFFOh. This instruction will force the processor to 
begin execution in the lowest 1MB of address 
space. 


The following table, Table 3-1, lists the core regis- 
ters and illustrates how they are initialized. 
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Table 3-1 Initialized Core Register Controls 
Register Register Name Initialized Contents Comments 
EAX Accumulator XXXX XXxXxh 0000 0000h indicates self-test passed. 
EBX Base XXXX XXxxh 
ECX Count XXXX XXxxh 
EDX Data xxxx 04 [DIRO] DIRO = Device ID 
EBP Base Pointer XXXX XXxxh 
ESI Source Index XXXX XXxXxh 
EDI Destination Index XXXX XXxxh 
ESP Stack Pointer XXXX XXxxh 
EFLAGS Flags 0000 0002h See Table 3-4 on page 45 for bit definitions. 
EIP Instruction Pointer 0000 FFFOh 
ES Extra Segment 0000h Base address set to 0000 0000h. 
Limit set to FFFFh. 
cS Code Segment FOOOh Base address set to FFFF 0000h. 
Limit set to FFFFh. 
ss Stack Segment 0000h Base address set to 0000 0000h. 
Limit set to FFFFh. 
DS Data Segment 0000h Base address set to 0000 0000h. 
Limit set to FFFFh. 
FS Extra Segment 0000h Base address set to 0000 0000h. 
Limit set to FFFFh. 
GS Extra Segment 0000h Base address set to 0000 0000h. 
Limit set to FFFFh. 
IDTR Interrupt Descriptor Table Base = 0, Limit = 3FFh 
Register 
GDTR Global Descriptor Table XXXX XXXXM_ XxXxXxh 
Register 
LDTR Local Descriptor Table Register | xxxx xxxxh, xxxxh 
TR Task Register Xxxxh 
CRO Machine Status Word 6000 0010h See Table 3-7 on page 48 for bit definitions. 
CR2 Control Register 2 XXXX XXXxN See Table 3-7 on page 48 for bit definitions. 
CR3 Control Register 3 XXXX XXXxh See Table 3-7 on page 48 for bit definitions. 
CR4 Control Register 4 0000 0000h See Table 3-7 on page 48 for bit definitions. 
CCR1 Configuration Control 1 00h See Table 3-11 on page 52 for bit definitions. 
CCR2 Configuration Control 2 00h See Table 3-11 on page 52 for bit definitions. 
CCR3 Configuration Control 3 00h See Table 3-11 on page 53 for bit definitions. 
CCR7 Configuration Control 7 00h See Table 3-11 on page 54 for bit definitions. 
SMARO SMM Address 0 00h See Table 3-11 on page 55 for bit definitions. 
SMAR1 SMM Address 1 00h See Table 3-11 on page 55 for bit definitions. 
SMAR2 SMM Address 2 / SMAR Size 00h See Table 3-11 on page 55 for bit definitions. 
DIRO Device Identification 0 4xh Device ID and reads back initial CPU clock- 
speed setting. 
See Table 3-11 on page 56 for bit definitions. 
DIR1 Device Identification 1 xxh Stepping and Revision ID (RO). 
See Table 3-11 on page 56 for bit definitions. 
DR7 Debug Register 7 0000 0400h See Table 3-13 on page 58 for bit definitions. 
Note: x = Undefined value 


Page 40 Cyrix Corporation Confidential GXm_db_v2.0 


Instruction Set Overview 3 


3.2 Instruction Set Overview 


The MediaGX processor instruction set can be 
divided into nine types of operations: 


¢ Arithmetic 

¢ Bit Manipulation 

* Shift/Rotate 

* String Manipulation 

¢ Control Transfer 

¢ Data Transfer 

¢ Floating Point 

¢ High-Level Language Support 
* Operating System Support 


MediaGxX processor instructions operate on as few 
as zero operands and as many as three operands. 
An NOP instruction (no operation) is an example of 
a zero-operand instruction. Two-operand instruc- 
tions allow the specification of an explicit source 
and destination pair as part of the instruction. 
These two-operand instructions can be divided into 
ten groups according to operand types: 


« Register to Register 

« Register to Memory 

¢« Memory to Register 

« Memory to Memory 

¢ Register to I/O 

« I/O to Register 

* Memory to I/O 

¢ I/O to Memory 

* Immediate Data to Register 
« Immediate Data to Memory 


An operand can be held in the instruction itself (as 
in the case of an immediate operand), in one of the 
processor’s registers or I/O ports, or in memory. An 
immediate operand is fetched as part of the 
opcode for the instruction. 


Operand lengths of 8, 16, 32 or 48 bits are 
supported as well as 64 or 80 bits associated with 
floating-point instructions. Operand lengths of 8 or 
32 bits are generally used when executing code 
written for 386- or 486-class (32-bit code) proces- 
sors. Operand lengths of 8 or 16 bits are generally 
used when executing existing 8086 or 80286 code 
(16-bit code). The default length of an operand can 
be overridden by placing one or more instruction 
prefixes in front of the opcode. For example, the 
use of prefixes allows a 32-bit operand to be used 
with 16-bit code or a 16-bit operand to be used with 
32-bit code. 


Section 9.1 “General Instruction Set Format” on 
page 234 contains the clock count table that lists 
each instruction in the CPU instruction set. 
Included in the table are the associated opcodes, 
execution clock counts, and effects on the Flags 
register. 


3.2.1 Lock Prefix 


The LOCK prefix may be placed before certain 
instructions that read, modify, then write back to 
memory. The PCI will not be granted access in the 
middle of locked instructions. The LOCK prefix can 
be used with the following instructions only when 
the result is a write operation to memory. 


Bit Test Instructions (BTS, BTR, BTC) 

Exchange Instructions (XADD, XCHG, 
CMPXCHG) 

One-Operand Arithmetic and Logical Instruc- 
tions (DEC, INC, NEG, NOT) 

Two-Operand Arithmetic and Logical Instruc- 
tions (ADC, ADD, AND, OR, SBB, SUB, 
XOR). 


An invalid opcode exception is generated if the 
LOCK prefix is used with any other instruction or 
with one of the instructions above when no write 
operation to memory occurs (for example, when 
the destination is a register). 
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Register Sets 


The accessible registers in the processor are 
grouped into three sets: 


a 


) 


ue) 


The Application Register Set contains the 
registers frequently used by application 
programmers. Table 3-2 shows the general 
purpose registers, segment registers, the 


instruction pointer register and the flag register. 


The System Register Set contains the regis- 
ters typically reserved for operating-systems 
programmers: control registers, system 
address registers, debug registers, configura- 
tion registers, and test registers. 


Table 3-2. Application Register Set 


> 4 Register Sets 


The Model Specific Register (MSR) Set is 
used to monitor the performance of the 
processor or a specific component within the 
processor. The model specific register set has 
one 64-bit register called the Time Stamp 
Counter. 


Each of these register sets are discussed in detail 
in the subsections that follow. Additional registers 
to support integrated MediaGX processor 
subsystems are described in Section 4.1 “Inte- 
grated Functions Programming Interface” of this 
manual. 


31 16 | 15 8/7 0 
AX 
AH AL 
EAX (Extended A Register) 
BX 
BH BL 
EBX (Extended B Register) 
CX 
CH CL 
ECX (Extended C Register) 
DX General 
DH DL Purpose 
EDX (Extended D Register) Bodiviots 
SI (Source Index) 
ESI (Extended Source Index) 
DI (Destination Index) 
EDI (Extended Destination Index) 
BP (Base Pointer) 
EBP (Extended Base Pointer) 
SP (Stack Pointer) 
ESP (Extended Stack Pointer) 
CS (Code Segment) 
SS (Stack Segment) 
DS (D Data Segment) Segment 
ES (E Data Segment) (pelocior) 
FS (F Data Segment) icaielels 
GS (G Data Segment) 
eC EIP (Extended instruction Pointer Regisier). S| Instruction Pointer and 
EFLAGS (Extended Flags Register) Flags Register 
a See en nee ns ern nner reeeecceeeeeeeeeeeeeeeee cece rans 5 
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3.3.1 Application Register Set 

The Application Register Set consists of the regis- 
ters most often used by the applications 
programmer. These registers are generally acces- 
sible, although some bits in the Flags register are 
protected. 


The General Purpose Register contents are 
frequently modified by instructions and typically 


contain arithmetic and logical instruction operands. 


In real mode, Segment Registers contain the 
base address for each segment. In protected 
mode, the segment registers contain segment 
selectors. The segment selectors provide indexing 
for tables (located in memory) that contain the 
base address for each segment, as well as other 
memory addressing information. 


The Instruction Pointer Register points to the 
next instruction that the processor will execute. 
This register is automatically incremented by the 
processor as execution progresses. 


The Flags Register contains control bits used to 
reflect the status of previously executed instruc- 

tions. This register also contains control bits that 
affect the operation of some instructions. 


3.3.1.1 General Purpose Registers 


The General Purpose Registers are divided into 
four data registers, two pointer registers, and two 
index registers as shown in Table 3-2 on page 42. 


The Data Registers are used by the applications 
programmer to manipulate data structures and to 
hold the results of logical and arithmetic opera- 
tions. Different portions of general data registers 
can be addressed by using different names. 


An “E” prefix identifies the complete 32-bit register. 
An “X” suffix without the “E” prefix identifies the 
lower 16 bits of the register. 


The lower two bytes of a data register are 
addressed with an “H” suffix (identifies the upper 
byte) or an “L” suffix (identifies the lower byte). 
These _L and_H portions of the data registers act 
as independent registers. For example, if the AH 
register is written to by an instruction, the AL 
register bits remain unchanged. 


The Pointer and Index Registers are listed below. 


Sl or ESI Source Index 
DI or EDI Destination Index 
SP or ESP Stack Pointer 
BP or EBP Base Pointer 


These registers can be addressed as 16- or 32-bit 
registers, with the “E” prefix indicating 32 bits. The 
pointer and index registers can be used as general 
purpose registers; however, some instructions use 
a fixed assignment of these registers. For example, 
repeated string operations always use ESI as the 
source pointer, EDI as the destination pointer, and 
ECX as a counter. The instructions that use fixed 
registers include multiply and divide, I/O access, 
string operations, stack operations, loop, variable 
shift and rotate, and translate instructions. 


The MediaGX processor implements a stack using 
the ESP register. This stack is accessed during the 
PUSH and POP instructions, procedure calls, 
procedure returns, interrupts, exceptions, and 
interrupt/exception returns. The MediaGX 
processor automatically adjusts the value of the 
ESP during operations that result from these 
instructions. 


The EBP register may be used to refer to data 
passed on the stack during procedure calls. Local 
data may also be placed on the stack and 
accessed with BP. This register provides a mecha- 
nism to access tack data in high-level languages. 
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3.3.1.2 Segment Registers 


The 16-bit segment registers, part of the main 
memory addressing mechanism, are described in 
Section 3.5 “Offset, Segment, and Paging Mecha- 
nisms” on page 66. The six segment registers are: 


CS - Code Segment 
DS - Data Segment 
SS - Stack Segment 
ES - Extra Segment 
FS - Additional Data Segment 
GS - Additional Data Segment 


The segment registers are used to select 
segments in main memory. A segment acts as 
private memory for different elements of a program 
such as code space, data space and stack space. 


There are two segment mechanisms, one for Real 
and Virtual 8086 Operating Modes and one for 
Protective Mode. Initialization and transition to 
protective mode is described in Section 3.13.4 
“Initialization and Transition to Protected Mode” on 
page 99. The segment mechanisms are described 
in Section 3.7 “Descriptors and Segment Mecha- 
nisms” on page 68. 


Table 3-3 Segment Register Selection Rules 


The active segment register is selected according 
to the rules listed in Table 3-3 and the type of 
instruction being currently processed. In general, 
the DS register selector is used for data refer- 
ences. Stack references use the SS register, and 
instruction fetches use the CS register. While some 
of these selections may be overridden, instruction 
fetches, stack operations, and the destination write 
operation of string operations cannot be over- 
ridden. Special segment-override instruction 
prefixes allow the use of alternate segment regis- 
ters. These segment registers include the ES, FS, 
and GS registers. 


3.3.1.3 Instruction Pointer Register 


The Instruction Pointer (EIP) Register contains 
the offset into the current code segment of the next 
instruction to be executed. The register is normally 
incremented by the length of the current instruction 
with each instruction execution unless it is implicitly 
modified through an interrupt, exception, or an 
instruction that changes the sequential execution 
flow (for example JMP and CALL). 


Table 3-3 illustrates the code segment selection 
rules. 


Implied (Default) | Segment-Override 
Type of Memory Reference Segment Prefix 
Code Fetch CS None 
Destination of PUSH, PUSHF, INT, CALL, PUSHA instructions Ss None 
Source of POP, POPA, POPF, IRET, RET instructions Ss None 
Destination of STOS, MOVS, REP STOS, REP MOVS instructions ES None 
Other data references with effective address using base registers of: DS CS, ES, FS, GS, SS 
EAX, EBX, ECX, EDX, ESI, EDI, EBP, ESP 
Ss CS, DS, ES, FS, GS 
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3.3.1.4 Flags Register 


The Flags Register contains status information and referred to as the Flags register that is used when 
controls certain operations on the MediaGX executing 8086 or 80286 code. Table 3-4 gives the 
processor. The lower 16 bits of this register are bit formats for the EFLAGS Register. 


Table 3-4 EFLAGS Register 
Bit Name Flag Type | Description 


31:22 RSVD -- Reserved — Set to 0. 

21 ID System Identification Bit — The ability to set and clear this bit indicates that the CPUID instruction is 

supported. The ID can be modified only if the CPUID bit in CCR4 (Index E8h[7]) is set. 
20:19 RSVD -- Reserved — Set to 0. 

18 AG System Alignment Check Enable — In conjunction with the AM flag in CRO, the AC flag determines 
whether or not misaligned accesses to memory cause a fault. If AC is set, alignment faults are 
enabled. 

17 VM System Virtual 8086 Mode — If set while in protected mode, the processor switches to virtual 8086 


operation handling segment loads as the 8086 does, but generating exception 13 faults on 
privileged opcodes. The VM bit can be set by the IRET instruction (if current privilege level 
is 0) or by task switches at any privilege level. 

16 RF Debug Resume Flag — Used in conjunction with debug register breakpoints. RF is checked at 
instruction boundaries before breakpoint exception processing. If set, any debug fault is 
ignored on the next instruction. 


15 RSVD -- Reserved — Set to 0. 
14 NT System Nested Task — While executing in protected mode, NT indicates that the execution of the 
current task is nested within another task. 
13:12 IOPL System 1/0 Privilege Level — While executing in protected mode, IOPL indicates the maximum cur- 


rent privilege level (CPL) permitted to execute I/O instructions without generating an exception 
13 fault or consulting the I/O permission bit map. IOPL also indicates the maximum CPL allow- 
ing alteration of the IF bit when new values are popped into the EFLAGS register. 

11 OF Arithmetic | Overflow Flag — Set if the operation resulted in a carry or borrow into the sign bit of the result 
but did not result in a carry or borrow out of the high-order bit. Also set if the operation resulted 
in a carry or borrow out of the high-order bit but did not result in a carry or borrow into the sign 

bit of the result. 

10 DF Control Direction Flag — When cleared, DF causes string instructions to auto-increment (default) the 
appropriate index registers (ESI and/or EDI). Setting DF causes auto-decrement of the index 

registers to occur. 


9 IF System Interrupt Enable Flag — When set, maskable interrupts (INTR input pin) are acknowledged 
and serviced by the CPU. 

8 TF Debug Trap Enable Flag — Once set, a single-step interrupt occurs after the next instruction com- 
pletes execution. TF is cleared by the single-step interrupt. 

7 SF Arithmetic | Sign Flag — Set equal to high-order bit of result (0 indicates positive, 1 indicates negative). 

6 ZF Arithmetic | Zero Flag — Set if result is zero; cleared otherwise. 

5 RSVD -- Reserved — Set to 0. 

4 AF Arithmetic | Auxiliary Carry Flag — Set when a carry out of (addition) or borrow into (subtraction) bit posi- 
tion 3 of the result occurs; cleared otherwise. 

3 RSVD = Reserved — Set to 0. 

2 PF Arithmetic | Parity Flag — Set when the low-order 8 bits of the result contain an even number of ones; 
otherwise PF is cleared. 

1 RSVD Reserved — Set to 1. 

0 CF Arithmetic | Carry Flag — Set when a carry out of (addition) or borrow into (subtraction) the most signifi- 


cant bit of the result occurs; cleared otherwise. 
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3.3.2 System Register Set 


The system register set, shown in Table 3-5, 
consists of registers not generally used by applica- 
tion programmers. These registers are typically 
employed by system level programmers who 
generate operating systems and memory manage- 
ment programs. Associated with the system 
register set are certain tables and segments which 
are listed in Table 3-5. 


The Control Registers control certain aspects of 
the MediaGX processor such as paging, copro- 
cessor functions, and segment protection. 


The Descriptor Tables hold descriptors that 
manage memory segments and tables, interrupts 
and task switching. The tables are defined by 
corresponding registers. 


The two Task State Segments Tables defined by 
TSS register are used to save and load the 
computer state when switching tasks. 


The Configuration Registers are used to define 
Cyrix MediaGX CPU setup including cache 
management. 


The ID registers allow BIOS and other software to 
identify the specific CPU and stepping. System 
Management Mode (SMM) control information is 
stored in the SMM registers. 


The Debug Registers provide debugging facilities 
for the MediaGX processor and enable the use of 
data access breakpoints and code execution 
breakpoints. 


The Test Registers provide a mechanism to test 
the contents of both the on-chip 16KB cache and 
the Translation Lookaside Buffer (TLB). The TLB is 
used as a cache for the tables that are used in to 
translate linear addresses to physical addresses 
while paging is enabled. 


Table 3-5 lists the system register sets along with 
their size and function. 


Table 3-5 System Register Set 


Register Sets 


Width 
Group Name Function (Bits) 
Control CRO System Control 32 
Registers Register 
CR2 Page Fault Linear 32 
Address Register 
CR3 Page Directory Base Reg- 32 
ister 
CR4 Time Stamp Counter 32 
Descriptor GDT General Descriptor Table 32 
Tables IDT Interrupt Descriptor Table 32 
LDT Local Descriptor Table 16 
Descriptor GDTR GDT Register 32 
ee IDTR IDT Register 32 
sters 
mics LDTR _| LDT Register 16 
Task State TSS Task State Segment 16 
Segment and Tables 
Registers TR TSS Register Setup 16 
Configuration | CCRn Configuration Control 8 
Registers Registers 
ID DIRn Device Identification 8 
Registers Registers 
SMM SMARn SMM Address Region 8 
Registers Registers 
SMHRn_ | SMM Header Addresses 8 
Performance | PCRO Performance Control 8 
Registers Register 
Debug DRO Linear Breakpoint 32 
Registers Address 0 
DR1 Linear Breakpoint 32 
Address 1 
DR2 Linear Breakpoint 32 
Address 2 
DR3 Linear Breakpoint 32 
Address 3 
DR6 Breakpoint Status 32 
DR7 Breakpoint Control 32 
Test TR3 Cache Test 32 
Registers TR4 Cache Test 32 
TR5 Cache Test 32 
TR6 TLB Test Control 32 
TR7 TLB Test Status 32 
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3.3.2.1 Control Registers state of the CPU. The lower 16 bits of CRO are 
A map of the Control Registers (CRO, CR2, CR3, referred to as the Machine Status Word (MSW). 
and CR&4) is shown in Table 3-6 and the bit defini- 
tions given in Table 3-7. (These registers should not 
be confused with the CRRn registers.) The CRO 
register contains system control bits which 
configure operating modes and indicate the general 


When operating in real mode, any program can read 
and write the control registers. In protected mode, 
however, only privilege level 0 (most-privileged) 
programs can read and write these registers. 


Table 3-6 Control Registers Map 


31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20/19/18) 17) 16)15)14/13/12)/11)10;9)}8)}7/6/5)4})3/;2/1)0 


CR4 Register 


RSVD T | RSVD 
iS) 
Cc 
CR3 Register 
PDBR (Page Directory Base Register) RSVD 0|;0 RSVD 


CR2 Register 


PFLA (Page Fault Linear Address) 
CRi Register 


RSVD 
Ga Tin inet 
CRO Register 
P|CIN RSVD A|R|W RSVD N/}1]/T]E}M/P 
G|D/W M|S|/P E S|M|P/E 

Vv 
D 
Machine Status Word (MSW) 
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Table 3-7 CR4-CRO Bit Definitions 


Bit Name | Description 


CR4 Register 


31:3. | RSVD_ | Reserved: Set to 0 (always returns 0 when read). 


2 TSC Time Stamp Counter Instruction: 
If = 1 RDTSC instruction enabled for CPL = 0 only; reset state. 
If = 0 RDTSC instruction enabled for all CPL states. 


1:0 | RSVD_ | Reserved — Set to 0 (always returns 0 when read). 


CR3 Register 


31:12 | PDBR_ | Page Directory Base Register: Identifies page directory base address on a 4KB page boundary. 
11:0 | RSVD_ | Reserved: Set to 0. 


CR2 Register 


31:0 | PFLA Page Fault Linear Address: With paging enabled and after a page fault, PFLA contains the linear address of 
the address that caused the page fault. 


CRO Register 


31 PG Paging Enable Bit: If PG = 1 and protected mode is enabled (PE = 1), paging is enabled. After changing the 
state of PG, software must execute an unconditional branch instruction (e.g., JMP, CALL) to have the change 
take effect. 

30 CD Cache Disable: If CD = 1, no further cache line fills occur. However, data already present in the cache continues 


to be used if the requested address hits in the cache. Writes continue to update the cache and cache invalida- 
tions due to inquiry cycles occur normally. The cache must also be invalidated to completely disable any cache 
activity. 

29 NW _ | Not Write-Through: If NW = 1, the on-chip cache operates in write-back mode. In write-back mode, writes are 
issued to the external bus only for a cache miss, a line replacement of a modified line, execution of a locked 
instruction, or a line eviction as the result of a flush cycle. If NW = 0, the on-chip cache operates in write-through 
mode. In write-through mode, all writes (including cache hits) are issued to the external bus. This bit cannot be 
changed if LOCK_NW = 1 in CCR2. 


18 AM Alignment Check Mask: If AM = 1, the AC bit in the EFLAGS register is unmasked and allowed to enable align- 
ment check faults. Setting AM = 0 prevents AC faults from occurring. 

16 WP Write Protect: Protects read-only pages from supervisor write access. WP = 0 allows a read-only page to be 
written from privilege level 0-2. WP = 1 forces a fault on a write to a read-only page from any privilege level. 

5 NE Numerics Exception: NE = 1 to allow FPU exceptions to be handled by interrupt 16. NE = 0 if FPU exceptions 
are to be handled by external interrupts. 

1 Reserved: Do not attempt to modify. 

3 TS Task Switched: Set whenever a task switch operation is performed. Execution of a floating point instruction with 
TS = 1 causes a DNA fault. If MP = 1 and TS = 1, a WAIT instruction also causes a DNA fault. 

2 EM Emulate Processor Extension: If EM = 1, all floating point instructions cause a DNA fault 7. 

1 MP Monitor Processor Extension: If MP = 1 and TS = 1, a WAIT instruction causes Device Not Available (DNA) 


fault 7. The TS bit is set to 1 on task switches by the CPU. Floating point instructions are not affected by the 
state of the MP bit. The MP bit should be set to one during normal operations. 


0 PE Protected Mode Enable: Enables the segment based protection mechanism. If PE = 1, protected mode is 
enabled. If PE = 0, the CPU operates in real mode and addresses are formed as in an 8086-style CPU. Refer to 
Section 3.13 “Protection” on page 97. 
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Table 3-8 Effects of Various Combinations of EM, TS, and MP Bits 


CRO[3:1] Instruction Type 
TS EM MP WAIT ESC 
0 0 0 Execute Execute 
0 0 1 Execute Execute 
1 0 0 Execute Fault 7 
1 0 1 Fault 7 Fault 7 
0 1 0 Execute Fault 7 
0 1 1 Execute Fault 7 
1 1 0 Execute Fault 7 
1 1 1 Fault 7 Fault 7 
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3.3.2.2 Configuration Registers 


The configuration registers listed in Table 3-9 are 
CPU registers and are selected by register index 
numbers. The registers are accessed through I/O 
memory locations 22h and 23h. Registers are 
selected for access by writing an index number to 
I/O Port 22h using an OUT instruction prior to 
transferring data through I/O Port 23h. 


Each data transfer through I/O Port 23h must be 
preceded by a register index selection through I/O 


Port 22h; otherwise, subsequent I/O Port 23h oper- 
ations are directed off-chip and produce external 
I/O cycles. 


If MAPEN, bit 4 of CCR3 (Index C3h[4]) = 0, 
external I/O cycles will occur if the register index 
number is outside the range COh-CFh, FEh, and 
FFh. The MAPEN bit should remain 0 during 
normal operation to allow system registers located 
at I/O Port 22h to be accessed (see Table 3-11 on 
page 53). 


Table 3-9 Configuration Register Summary 
Access Default Reference 

Index Type Name Controlled By* Value (Bit Formats) 

Cih R/W CCR1 — Configuration Control 1 SMI_LOCK 00h Table 3-11 on page 52 
C2h R/W CCR2 — Configuration Control 2 -- 00h Table 3-11 on page 52 
C3h R/W CCR3 — Configuration Control 3 SMI_LOCK 00h Table 3-11 on page 53 
E8h R/W CCR4 — Configuration Control 4 MAPEN 85h Table 3-11 on page 54 
EBh R/W CCR7 — Configuration Control 7 -- 00h Table 3-11 on page 54 
20h R/W PCR — Performance Control MAPEN 07h Table 3-11 on page 54 
BOh R/W SMHRO — SMM Header Address 0 MAPEN xxh Table 3-11 on page 55 
Bih R/W SMHR1 — SMM Header Address 1 MAPEN xxh Table 3-11 on page 55 
B2h R/W SMHR2 — SMM Header Address 2 MAPEN xxh Table 3-11 on page 55 
B3h R/W SMHR3 — SMM Header Address 3 MAPEN xxh Table 3-11 on page 55 
B8h R/W GCR — Graphics Control Register MAPEN 00h Table 4-1 on page 104 
B9h VGACTL — VGA Control Register -- 00h Table 5-5 on page 200 
BAh-BDh VGAM0 — VGA Mask Register -- 00h Table 5-5 on page 200 
CDh R/W SMARO — SMM Address 0 SMI_LOCK 00h Table 3-11 on page 55 
CEh R/W SMAR1 — SMM Address 1 SMI_LOCK 00h Table 3-11 on page 55 
CFh R/W SMAR2 — SMM Address 2 SMI_LOCK 00h Table 3-11 on page 55 
FEh RO DIRO — Device ID 0 -- 4xh Table 3-11 on page 56 
FFh RO DIR1 — Device ID 1 -- xxh Table 3-11 on page 56 


*Note: MAPEN = Index C3h[4] (CCR3) and SMI_LOCK = Index C3h[0] (CCR3). 
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Table 3-10 Configuration Register Map 


Register 
(Index) Bit 7 Bit 6 Bit 5 Bit 4 Bit 3 Bit 2 Bit 1 Bit 0 


Control Registers 


CCR1 (Cth) RSVD SMAC USE_SMI RSVD 
CCR2 (C2h) USE_SUSP RSVD WT1 SUSP_HLT | LOCK_NW RSVD 

CCR (C3h) LSS_34 LSS_23 LSS_12 MAPEN RSVD NMI_EN SMI_LOCK 
CCR4 (E8h) CPUID SMI_NEST RSVD DTE_EN MEM_BYP IORT2 IORT1 IORTO 
CCR7 (EBh) RSVD NMI RSVD EMMX 
PCR (20h) LSSER RSVD 


ee SS SS ——————————————————————————————————— a 
Device ID Registers 


DIRO (FEh) DID3 DID2 DID1 DIDO RSVD CLKMODE1 RSVD CLMODEO 
DIR1 (FFh) SID3 SID2 SID1 SIDO RID3 RID2 RID1 RIDO 
SMM Base Header Address Registers 

SMARO (CDh) A31 A30 A29 A28 A27 A26 A25 A24 
SMAR1 (CEh) A23 A22 A21 A20 A19 A18 A17 A16 
SMAR2 (CFh) A15 A14 A13 Al12 SIZE3 SIZE2 SIZE1 SIZEO 
SMHRO (BOh) A7 A6 A5 A4 A3 A2 Al AO 
SMHRI1 (Bt1h) A15 A14 A13 A12 Alt A10 AQ A8 
SMHR2 (B2h) A23 A22 A21 A20 A19 A18 A17 A16 
SMHR3 (B3h) A31 A30 A29 A28 A27 A26 A26 A24 


aR eae a a Ec cS rf PO LY 
Graphics/VGA Related Registers 


GCR (B8h) RSVD Scratchpad Size Base Address Code 

VGACTL RSVD Enable SMI | Enable SMI | Enable SMI 

(B9h) for VGA for VGA for VGA 
memory memory memory 

B8000h to B0000h to A0000h to 

BFFFFh B7FFFh AFFFFh 

VGAMO (BAh) VGA Mask Register Bits [7:0] 

VGAM1 (BBh) VGA Mask Register Bits [15:8] 

VGAM2 (BCh) VGA Mask Register Bits [23:16] 

VGAMS (BDh) VGA Mask Register Bits [31:24] 
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Table 3-11 Configuration Registers 


Bit Name Description 
Index Cth CCR1 — Configuration Control Register 1 (R/W) Default Value = 00h 
7:3 RSVD Reserved: Set to 0. 
2 SMAC System Management Memory Access: 
If = 1: SMINT instruction can be recognized (see Table 3-33 on page 88). 
If = 0: SMINT instruction has no affect. 
Note: SMI_LOCK (CCR3[0]) must = 0, or the CPU must be in SMI mode, to write this bit. 
1 USE_SMI Enable SMM Pins: 
If = 1: SMI# input pin is enabled (see Table 3-33 on page 88). SMINT instruction can be recognized. 
If = 0: SMI# pin is ignored. 
Note: SMI_LOCK (CCR3[0]) must = 0, or the CPU must be in SMI mode, to write this bit. 
0 RSVD Reserved — Set to 0. 


Note: Bits 1 and 2 are cleared to zero at reset. 


Index C2h CCR2 — Configuration Control Register 2 (R/W) Default Value = 00h 
7 USE_SUSP | Enable Suspend Pins: 
If = 1: SUSP# input and SUSPA# output are enabled. 
If = 0: SUSP# input is ignored and SUSPA# output floats. 
6:5 RSVD Reserved: Set to 0. 
4 WT1 Write-Through Region 1: 
If = 1: Forces all writes to the address region between 640KB to 1MB that hit in the on-chip cache to be 
issued on the external bus. 
3 SUSP_HLT | Suspend on HALT: 
If = 1: CPU enters suspend mode following execution of a HALT instruction. 
2 LOCK_NW_ | Lock NW Bit: 
If = 1: Prohibits changing the state of the NW bit (CRO[29]) (refer to Table 3-7 on page 48). 
Set to 1 after setting NW. 
1:0 RSVD Reserved: Set to 0. 


Note: All bits are cleared to zero at reset. 
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Table 3-11 Configuration Registers (cont.) 


Bit Name Description 
Index C3h CCR3 — Configuration Control Register 3 (R/W) Default Value = 00h 
7 LSS_34 Load/Store Serialize 3 GBytes to 4 GBytes: 
If = 1: Strong R/W ordering imposed in address range C000 0000h to FFFF FFFFh: 
6 LSS_23 Load/Store Serialize 2 GBytes to 3 GBytes: 
If = 1: Strong R/W ordering imposed in address range 8000 0000h to BFFF FFFFh: 
5 LSS_12 Load/Store Serialize 1 GByte to 2 GBytes: 
If = 1: Strong R/W ordering imposed in address range 4000 0000h to 7FFF FFFFh 
4 MAPEN Map Enable: 


If = 1: All configuration registers are accessible. All accesses to Port 22h are trapped. 
If = 0: Only configuration registers Index C1h through CFh, FEh, FFh (CCRn, SMAR, DIRn) are acces- 
sible. Other configuration registers (including PCR, SMHRn, GCR, VGACTL, VGAMO) are not acces- 


sible. 
3:2 RSVD Reserved: Set to 0. 
1 NMI_EN NMI Enable: 


If = 1: NMI is enabled during SMM. 
If = 0: NMI is not recognized during SMM. 


Note: SMI_LOCK (CCR3(0]) must = 0 or the CPU must be in SMI mode to write to this bit. 
0 SMI_LOCK | SMM Register Lock: 


If = 1: SMM Address Region Register (SMAR[31:0]), SMAC (CCR1[2]), USE_SMI (CCR1[1]) 
cannot be modified unless in SMM routine. Once set, SMI_LOCK can only be cleared by asserting the 
RESET pin. 


Note: All bits are cleared to zero at reset. 
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Table 3-11 Configuration Registers (cont.) 


Bit Name Description 
Index E8h CCR4 — Configuration Control Register 4 (R/W) Default Value = 85h 
7 CPUID Enable CPUID Instruction: 


If = 1: The ID bit in the EFLAGS register to be modified and execution of the CPUID instruction occurs 
as documented in Table 9-2 "Instruction Fields" on page 234. 


If = 0: The ID bit can not be modified and execution of the CPUID instruction causes an invalid opcode 
exception. 

6 SMI_NEST | SMI Nest: 

If = 1: SMI interrupts can occur during SMM mode. SMI handlers can optionally set SMI_NEST high to 
allow higher-priority SMI interrupts while handling the current event 

5 RSVD Reserved — Set to 0. 

DTE_EN Directory Table Entry Cache: 

If = 1: Enables directory table entry to be cached. 

Cleared to 0 at reset. 

3 MEM_BYP_ | Memory Read Bypassing: 

If = 1: Enables memory read bypassing. 

Cleared to 0 at reset. 


2:0 IORT(2:0) I/O Recovery Time: Specifies the minimum number of bus clocks between I/O accesses: 
000 = No clock delay 100 = 16-clock delay 
001 = 2-clock delay 101 = 32-clock delay (default value after reset) 
010 = 4-clock delay 110 = 64-clock delay 
011 = 8-clock delay 111 = 128-clock delay 


Cleared to 0 at reset. 
Note: MAPEN (CCR9[4]) must = 1 to read or write to this register. 


Index EBh CCR7 — Configuration Control Register 7 (R/W) Default Value = 00h 
7:3 RSVD Reserved: Set to 0. 
2 NMI NMI Enable: 
If = 1: Non-maskable Interrupts (NMIs) are acknowledged. 
1 RSVD Reserved: Set to 0. 
0 EMMX Cyrix Extended MMxX Instructions Enable: 
If = 1: Cyrix extended MMxX instructions are enabled 


Index 20h PCR — Performance Control Register (R/W) Default Value = 07h 


7 LSSER Load/Store Serialize Enable (Reorder Disable): LSSER should be set to ensure that memory- 
mapped I/O devices operating outside of the address range 640K to 1M will operate correctly. For 
memory accesses above 1 GByte, refer to CCR3[7:5] (LSS_34, LSS_23, LSS_ 12.) 


If =1: All memory read and write operations will occur in execution order (load/store serializing 
enabled, reordering disabled). 


If =O: Memory reads and write can be reordered for optimum performance (load/store serializing 
disabled, reordering enabled). 


Memory accesses in the address range 640K to 1M will always be issued in execution order. 
6:0 RSVD Reserved — Set to 0. 
Note: MAPEN (CCR9[4]) must = 1 to read or write to this register. 
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Table 3-11 Configuration Registers (cont.) 


Bit 


Name 


Description 


Index BOh, B1h, B2h, B3h 


SMHR — SMI Header Address Register (R/W) Default Value = xxh 


Index SMHR Bits | SMM Header Address Bits [31:0]: SMHR address bits [31:0] contain the physical base address for 
B3h [31:24] the SMM header space: For example, bits [31:24] correspond with Index B3h 

Boh [23:16] Refer to Section 3.11.4 “SMM Configuration Registers” on page 89 for more information. 

Bih [15:12] 

Boh [7:0] 


Note: MAPEN (CCR9[4]) must = 1 to read or write to this register. 


Index CDh, CEh, CFh 


SMAR — SMM Address Region/Size Register (R/W) Default Value = 00h 


Index SMAR Bits | SMM Address Region Bits, (SMAR [A31:A12]) — SMAR address bits [31:12] contain the base 
CDh [31:24] address for the SMM region. 
CEh [23:16] Bits [31:24] correspond with Index CDh 
CFh{7:4] [15:12] Bits [23:16] correspond with Index CEh 
Bits [15:12] correspond with Index CFh[7:4] 
Index CFh allows simultaneous access to SMAR address regions bits SMAR[15:12] and size code bits 
SIZE[3:0]. During access, the upper 4-bits of Port 23h hold SMAR[15:12]. 
Refer to Section 3.11.4 “SMM Configuration Registers” on page 89 for more information. 
CFh[3:0] SIZE[3:0] SMM Region Size Bits, (SIZE [3:0]) — SIZE address bits contain the size code for the SMM region. 


During access the lower 4-bits of port 23 hold SIZE[3:0]. Index CFh allows simultaneous access to 
SMAR address regions bits SMAR[15:12] (See above) and size code bits SIZE[3:0]. 


0000 = SMM Disabled 0100 = 32KB 1000 = 512KB 1100 = 8MB 

0001 = 4KB 0101 = 64KB 1001 = 1MB 1101 = 16MB 

0010 = 8KB 0110 = 128KB 1010 = 2MB 1110 = 32MB 

0011 = 16KB 0111 = 256KB 1011 = 4MB 1111 = 4KB (same as 0001) 


Note: SMI_LOCK (CCR3(0]) must = 0, or the CPU must be in SMI mode, to write these registers/bits. 
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Table 3-11 Configuration Registers (cont.) 


Bit Name Description 

Index FEh DIRO — Device Identification Register 0 Default Value = 4xh 
7:4 DID[3:0] Device ID (Read Only) — Identifies device as MediaGX MMX-Enhanced processor. 
3:0 MULT[3:0] Core Multiplier (Read Only) — Identifies the core multiplier set by the CLKMODE[2:0] pins (see sig- 


nal descriptions page 21) 

If DIR1 (Index FFh) is 30h-4Fh then MULT[3:0]: 
0000 = SYSCLK multiplied by 4 (Test mode only) 
0001 = SYSCLK multiplied by 6 

0010 = SYSCLK multiplied by 4 (Test mode only) 
0011 = SYSCLK multiplied by 6 

0100 = SYSCLK multiplied by 7 

0101 = SYSCLK multiplied by 8 

0110 = SYSCLK multiplied by 7 

0111 = SYSCLK multiplied by 5 

1xxx = Reserved 

If DIR1 (Index FFh) is 50h or greater then MULT[3:0]: 
0000 = SYSCLK multiplied by 4 (Test mode only) 
0001 = SYSCLK multiplied by 10 

0010 = SYSCLK multiplied by 4 (Test mode only) 
0011 = SYSCLK multiplied by 6 

0100 = SYSCLK multiplied by 9 

0101 = SYSCLK multiplied by 5 

0110 = SYSCLK multiplied by 7 

0111 = SYSCLK multiplied by 8 

1xxx = Reserved 


Index FFh DIR1 -- Device Identification Register 1 Default Value = xxh 


7:0 DIR1 Device Identification Revision (Read Only) — DIR1 indicates device revision number. 
If DIR1 is 30h-33h = MediaGX MMX-Enhanced processor revision 1.0-2.3 

If DIR1 is 34h-4Fh = MediaGX MMX-Enhanced processor revision 2.4-3.x 

If DIR1 is 50h or greater = MediaGX MMX-Enhanced processor revision 4.0 and up. 
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3.3.2.3. Debug Registers 


Six debug registers (DRO-DR3, DR6 and DR7) 
support debugging on the MediaGX processor. 
Memory addresses loaded in the debug registers, 
referred to as “breakpoints,” generate a debug 
exception when a memory access of the specified 
type occurs to the specified address. A breakpoint 
can be specified for a particular kind of memory 
access such as a read or write operation. Code 
and data breakpoints can also be set allowing 
debug exceptions to occur whenever a given data 
access (read or write operation) or code access 
(execute) occurs. The size of the debug target can 
be set to 1, 2, or 4 bytes. The debug registers are 
accessed through MOV instructions that can be 
executed only at privilege level 0 (real mode is 
always privilege level 0). 


Table 3-12 Debug Registers 


The Debug Address Registers (DRO-DR3) each 
contains the linear address for one of four possible 
breakpoints. Each breakpoint is further specified by 
bits in the Debug Control Register (DR7). For each 
breakpoint address in DRO-DR39, there are corre- 
sponding fields L, R/W, and LEN in DR7 that 
specify the type of memory access associated with 
the breakpoint. 


The R/W field can be used to specify instruction 
execution as well as data access breakpoints. 
Instruction execution breakpoints are always taken 
before execution of the instruction that matches the 
breakpoint. The Debug Registers are mapped in 
Table 3-12 


DR6 Register 


31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20) 19) 18/17) 16) 15/14/13)12/11/10;9)/8;/7;6)/5)/4/3},2)1)0 
DR7 Register 
LEN3 | R/W3 | LEN2 | R/W2 | LEN1 | R/W1 | LENO | R/WO LO 


DR3 Register 


DR2 Register 


Breakpoint 3 Linear Address 


DR1 Register 


Breakpoint 2 Linear Address 


DRO Register 


Breakpoint 1 Linear Address 


Breakpoint 0 Linear Address 


Note: All bits marked as 0 or 1 are reserved and should not be modified. 
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The Debug Status Register (DR6) reflects condi- 
tions that were in effect at the time the debug 
exception occurred. The contents of the DR6 
register are not automatically cleared by the 
processor after a debug exception occurs, and 
therefore should be cleared by software at the 
appropriate time. Table 3-13 lists the field definitions 
for the DR6 and DR7 registers. 


Table 3-13 DR7 and DR6 Bit Definitions 


Register Sets 


Code execution breakpoints may also be gener- 
ated by placing the breakpoint instruction (INT3) at 
the location where control is to be regained. The 
single-step feature may be enabled by setting the 
TF flag (bit 8) in the EFLAGS register. This causes 
the processor to perform a debug exception after 
the execution of every instruction. Debug Registers 
6 and 7 are shown in Table 3-13. 


Number 
Field(s) of Bits Description 

DR7 Register 

R/Wn 2 Applies to the DRn breakpoint address register: 
00 = Break on instruction execution only 
01 = Break on data write operations only 
10 = Not used 
11 = Break on data reads or write operations. 

LENn 2 Applies to the DRn breakpoint address register: 
00 = One-byte length 
01 = Two-byte length 
10 = Not used 
11 = Four-byte length. 

Gn 1 If = 1: breakpoint in DRn is globally enabled for all tasks and is not cleared by the processor as the 
result of a task switch. 

Ln 1 If = 1: breakpoint in DRn is locally enabled for the current task and is cleared by the processor as 
the result of a task switch. 

GD 1 Global disable of debug register access. GD bit is cleared whenever a debug exception occurs. 

DR6 Register 

Bn 1 Bn is set by the processor if the conditions described by DRn, R/Wn, and LENn occurred when the 
debug exception occurred, even if the breakpoint is not enabled via the Gn or Ln bits. 

BT 1 BT is set by the processor before entering the debug handler if a task switch has occurred to a task 
with the T bit in the TSS set. 

BS 1 BS is set by the processor if the debug exception was triggered by the single-step execution mode 
(TF flag, bit 8, in EFLAGS set). 

Note: n=0,1,2,and3 
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3.3.2.4 Test Registers 


The five test registers are used in testing the 
CPU’s Translation Lookaside Buffer (TLB) and on- 
chip cache. TR6 and TR7 are used for TLB testing, 
and TR3-TR5 are used for cache testing. Table 3-14 
is a register map for the Test Registers with their bit 
definitions given in Tables 3-15 and 3-16. 


TLB Test Registers 

The CPU TLB is a 32-entry, four-way set associa- 
tive memory. Each TLB entry consists of a 24-bit 
tag and 20-bit data. The 24-bit tag represents the 
high-order 20 bits of the linear address, a valid bit, 
and three attribute bits. The 20-bit data portion 
represents the upper 20 bits of the physical 
address that corresponds to the linear address. 


Table 3-14 Test Registers 


The TLB Test Data Register (TR7) contains the 
upper 20 bits of the physical address (TLB data 
field), three LRU bits and a control bit. During TLB 
write operations, the physical address in TR7 is 
written into the TLB entry selected by the contents 
of TR6. During TLB lookup operations, the TLB 
data selected by the contents of TR6 is loaded into 
TR7. Table 3-15 lists the bit definitions for TR7 and 
TR6. 


The TLB Test Control Register (TR6) contains a 
command bit, the upper 20 bits of a linear address, 
a valid bit and the attribute bits used in the test 
operation. The contents of TR6 are used to create 
the 24-bit TLB tag during both write and read (TLB 
lookup) test operations. The command bit defines 
whether the test operation is a read or a write. 


TR7 Register 


31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20/19) 18) 17) 16)15)14/13/12)/11)10;9|/8)}7/;6/5)4})3/2/1)0 


Physical Address 
TR6 Register 


0 | 0] TLBLRU | 0 | 0/PL| REP | 0/0 


Linear Address 


TR5 Register 


V | D |D#| U |U#} R |R#}/0}0;/0/0]C 


RSVD 


TR4 Register 


Line Selection Set/ CTL 


Cache Tag Address 


TR3 Register 


0|V Cache 
LRU Bits 


Dirty Bis |0]0]0 


Cache Data 
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Table 3-15 TR7-TR6 Bit Definitions 


Bit Name Description 


TR7 Register 


31:12 Physical Physical Address: 
Address —_| TLB lookup: Data field from the TLB. 
TLB write: Data field written into the TLB. 
11:10 RSVD Reserved: Set to 0. 
9:7 TLB LRU LRU Bits: 
TLB lookup: LRU bits associated with the TLB entry before the TLB lookup. 
TLB write: Ignored. 
4 PL PL Bit: 
TLB lookup: If PL = 1, read hit occurred. If PL = 0, read miss occurred. 


TLB write: If PL = 1, REP field is used to select the set. If PL = 0, the pseudo-LRU replacement algo- 
rithm is used to select the set. 


3:2 REP Set Selection: 

TLB lookup: If PL = 1, this field indicates the set in which the tag was found. If PL = 0, undefined data. 
TLB write: If PL = 1, this field selects one of the four sets for replacement. If PL = 0, ignored. 

1:0 RSVD Reserved: Set to 0. 


TR6 Register 


31:12 Linear Linear Address: 


Address —_| TLB lookup: The TLB is interrogated per this address. If one and only one match occurs in the TLB, 
the rest of the fields in TR6 and TR7 are updated per the matching TLB entry. 


TLB write: A TLB entry is allocated to this linear address. 


11 Vv Valid Bit: 
TLB write: If V = 1, the TLB entry contains valid data. If V = 0, target entry is invalidated. 
10:9 D, D# Dirty Attribute Bit and its Complement (D, D#) 
8:7 U, U# User/Supervisor Attribute Bit and its Complement (U, U#) 
6:5 R, R# Read/Write Attribute Bit and its Complement (R, R#) 
Effect on TLB Lookup Effect on TLB Write 
00 = Do not match Undefined 
01 = Match if D, U, or R bit is a0 Clear the bit 
10= Match if D, U, or R bitis a 1 Set the bit 
11= Match if D, U, or R bit is either a 1 or 0 Undefined 
4:1 RSVD Reserved: Set to 0. 
0 Cc Command Bit: 


If C = 1: TLB lookup. 
If C = 0: TLB write. 
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Cache Test Registers 


The CPU’s 16KB on-chip cache is a four-way set 
associative memory that is configured as write- 
back cache. Each cache set contains 256 entries. 
Each entry consists of a 20-bit tag address, a 16- 
byte data field, a valid bit, and four dirty bits. 


The 20-bit tag represents the high-order 20 bits of 
the physical address. The 16-byte data represents 
the 16 bytes of data currently in memory at the 
physical address represented by the tag. The valid 
bit indicates whether the data bytes in the cache 
actually contain valid data. The four dirty bits indi- 
cate if the data bytes in the cache have been modi- 
fied internally without updating external memory 
(write-back configuration). Each dirty bit indicates 


the status for one double-word (4 bytes) within the 
16-byte data field. 


For each line in the cache, there are three LRU bits 
that indicate which of the four sets was most 
recently accessed. A line is selected using bits 
[11:4] of the physical address. Figure 3-2 illustrates 
the CPU cache architecture. 


The CPU contains three test registers (TR5-TR3) 
that allow testing of its internal cache. Bit defini- 
tions for the cache test registers are shown in 
Table 3-16. Using a 16-byte cache fill buffer and a 
16-byte cache flush buffer, cache reads and writes 
may be performed. 


Figure 3-1 illustrates how the internal cache archi- 
tecture works. 


WN = cache Entry (153 bits) 
Tag Address (20 bits) 
Data (128 bits) 
Valid Status (1 bit) 
Dirty Status (4 bits) 


Line 
Address Set 0 Set 1 Set 2 Set 3 LRU 
D 255 ————7 
E 254 
AIt-A4, : 
: 0 
152 ---0 152 ---0 152 ---0 152 ---0 2---0 


Figure 3-1 CPU Cache Architecture 
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Table 3-16 TR5-TR3 Bit Definitions 


Bit 


Name 


Description 


TR5 Regis 


ter 


11:4 


Line Selec- 
tion 


Line Selection: 
Physical address bits 11-4 used to select one of 256 lines. 


3:2 


Set/DWord 
Selection 


Set/DWord Selection: 


Cache read: Selects which of the four sets in the cache is used as the source for data 
transferred to the cache flush buffer. 


Cache write: Selects which of the four sets in the cache is used as the destination for data transferred 
from the cache fill buffer. 


Flush buffer read: Selects which of the four Dword in the flush buffer is 
used during a TR3 read. 


Fill buffer write: Selects which of the four Dword in the fill buffer is written during a TR3 write. 


1:0 


Control Bits 


Control Bits: 

If = 00: flush read or fill buffer write. 
If = 01: cache write. 

If = 10: cache read. 

If = 11: cache flush. 


TR4 Regis 
31:12 


ter 


Upper Tag 
Address 


Upper Tag Address: 
Cache read: Upper 20 bits of tag address of the selected entry. 
Cache write: Data written into the upper 20 bits of the tag address of the selected entry. 


Valid Bit 


Valid Bit: 
Cache read: Valid bit for the selected entry. 
Cache write: Data written into the valid bit for the selected entry. 


9:7 


LRU Bits 


LRU Bits: 


Cache read: The LRU bits for the selected line. 

xx1 = Set 0 or Set 1 most recently accessed. 

xx0 = Set 2 or Set 3 most recently accessed. 

x1x = Most recent access to Set 0 or Set 1 was to Set 0. 
x0x = Most recent access to Set 0 or Set 1 was to Set 1. 
1xx = Most recent access to Set 2 or Set 3 was to Set 2. 
Oxx = Most recent access to Set 2 or Set 3 was to Set 3. 


Cache write: Ignored. 


6:3 


Dirty Bits 


Dirty Bits: 
Cache read: The dirty bits for the selected entry (one bit per DWord). 
Cache write: Data written into the dirty bits for the selected entry. 


2:0 


RSVD 


Reserved: Set to 0. 


TR3 Regis 


ter 


31:0 


Cache Data 


Cache Data: 
Flush buffer read: Data accessed from the cache flush buffer. 
Fill buffer write: Data to be written into the cache fill buffer. 
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There are five types of test operations that can be 
executed: 


Flush buffer read 
Fill buffer write 

* Cache write 

* Cache read 
Cache flush 


Each of these operations is described in detail in 
Table 3-17. To fill a cache line with data, the fill 


Table 3-17 Cache Test Operations 


buffer must be written four times. Once the fill 
buffer holds a complete cache line of data (16 
bytes), a cache write operation transfers the data 
from the fill buffer to the cache. 


To read the contents of a cache line, cache read 
operation transfers the data in the selected cache 
line to the flush buffer. Once the flush buffer is 
loaded, the programmer accesses the contents of 
the flush buffer by executing four flush buffer read 
operations. 


Test Operation Code Sequence Action Taken 

Flush Buffer Read MOV TRS, Oh Set DWORD = 0, control = 00 = flush buffer read. 
MOV dest,TR3 Flush buffer (31:0) --> dest. 
MOV TRS, 4h Set DWORD = 1, control = 00 = flush buffer read. 
MOV dest,TR3 Flush buffer (63:32) --> dest. 
MOV TRS, 8h Set DWORD = 2, control = 00 = flush buffer read. 
MOV dest,TR3 Flush buffer (95:64) --> dest. 
MOV TR85, Ch Set DWORD = 3, control = 00 = flush buffer read. 
MOV dest,TR3 Flush buffer (127:96) --> dest. 

Fill Buffer Write MOV TRS, Oh Set DWORD = 0, control = 00 = fill buffer write. 
MOV TR3, cache_data Cache_data --> fill buffer (31:0). 
MOV TRS, 4h Set DWORD = 1, control = 00 = fill buffer write. 
MOV TR3, cache_data Cache_data --> fill buffer (63:32). 
MOV TRS, 8h Set DWORD = 2, control = 00 = fill buffer write. 
MOV TR3, cache_data Cache_data --> fill buffer (95:64). 
MOV TRS, Ch Set DWORD = 3, control = 00 = fill buffer write. 
MOV TR3, cache_data Cache_data --> fill buffer (127:96). 

Cache Write MOV TR4, cache_tag Cache_tag --> tag address, valid and dirty bits. 
MOV TR8, line+set+control=01 Fill buffer (127:0) --> cache line (127:0). 

Cache Read MOV TRS, line+set+control=10 Cache line (127:0) --> flush buffer (127:0). 
MOV dest, TR4 Cache line tag address, valid/LRU/dirty bits --> dest. 

Cache Flush MOV TRS, 3h Control = 11 = cache flush, all cache valid bits = 0. 

Sa SSS Es 
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3.3.3 Model Specific Register 


The model specific register (MSR) set is used to 
monitor the performance of the processor or a 
specific component within the processor. 


A MSR register can be read using the RDMSR 
instruction, opcode OF32h. During a MSR register 
read, the contents of the particular MSR register, 
specified by the ECX register, is loaded into the 
EDX:EAX registers. 


A MSR register can be written using the WRMSR 
instruction, opcode OF30h. During a MSR register 
write, the contents of EX:EAX are loaded into the 
MSR register specified in the ECX register. 


The RDMSR and WRMSR instructions are privi- 
leged instructions. 


The MediaGX MMX-Enhanced processor contains 
one 64-bit model specific register (MSR10) the 
Time Stamp Counter (TSC). 


3.3.4 Time Stamp Counter 


The processor contains a model specific register 
(MSR) called the Time Stamp Counter (TSC). The 
TSC, (MSR[10]), is a 64-bit counter that counts the 
internal CPU clock cycles since the last reset. The 
TSC uses a continuous CPU core clock and will 
continue to count clock cycles even when the 
processor is in suspend or shutdown mode. 


The TSC is read using a RDMSR instruction, 
opcode OF 32h, with the ECX register set to 10h. 
During a TSC read, the contents of the TSC 
register is loaded into the EDX:EAX registers. 


The TSC is written to using a WRMSR instruction, 
opcode OF 30h with the ECX register set to 10h. 
During a TSC write, the contents of EX:EAX are 
loaded into the TSC. 


The RDMSR and WRMSR instructions are privi- 
leged instructions. 


In addition, the TSC can be read using the RDTSC 
instruction, opcode OF 31h. The RDTSC instruction 
loads the contents of the TSC into EDX:EAX. The 
use of the RDTSC instruction is restricted by the 
TSC flag (bit 2) in the CR4 register (refer to Tables 
3-6 and 3-7 on pages 47 and 48 for CR4 register 
information). When the TSC bit = 0, the RDTSC 
instruction can be executed at any privilege level. 
When the TSC bit = 1, the RDTSC instruction can 
only be executed at privilege level 0. 
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3.4 Address Spaces 


The MediaGX processor can directly address 
either memory or I/O space. Figure 3-2 illustrates 
the range of addresses available for memory 
address space and I/O address space. For the 
CPU, the addresses for physical memory range 
between 0000 0000h and FFFF FFFFh 

(4 GBytes). The accessible I/O addresses space 
ranges between 0000 0000h and 0000 FFFFh 
(64KB). The CPU does not use coprocessor 
communication space in upper I/O space between 
800000F8h and 800000F Fh as do the 386-style 
CPUs. The I/O locations 22h and 23h are used for 
MediaGX processor configuration register access. 


3.4.1 I/O Address Space 


The CPU I/O address space is accessed using IN 
and OUT instructions to addresses referred to as 
“ports”? The accessible I/O address space is 64KB 
and can be accessed as 8-bit, 16-bit or 32-bit 
ports. 


The MediaGX processor configuration registers 
reside within the I/O address space at port 


addresses 22h and 23h and are accessed using 
the standard IN and OUT instructions. 


The configuration registers are modified by writing 
the index of the configuration register to port 22h, 
and then transferring the data through port 23h. 
Accesses to the on-chip configuration registers do 
not generate external I/O cycles. However, each 
operation on port 23h must be preceded by a write 
to port 22h with a valid index value. Otherwise, 
subsequent port 23h operations will communicate 
through the I/O port to produce external I/O cycles 
without modifying the on-chip configuration regis- 
ters. Write operations to port 22h outside of the 
CPU index range (COh-CFh and FEh-FFh) result in 
external I/O cycles and do not affect the on-chip 
configuration registers. Reading port 22h gener- 
ates external I/O cycles. 


I/O accesses to port address range 3BOh through 
3DFh can be trapped to SMI by the CPU if this 
option is enabled in the BC_XMAP_1 register (see 
SMIB, SMIC, and SMID bits in Table 4-9 on page 
113). Figure 3-2 illustrates the I/O address space. 


Accessible 
Physical Programmed 
Memory Space V/O Space 
FFFF FFFFh FFFF FFFFh 
Not 
Physical Memory Accessible 
4GB 
CPU General 
0000 FFFFh Configuration 
Register |/O 
64KB Space 
4 0000 0023h 
0000 0000h 0000 0000h 0000 0022h 
Figure 3-2 Memory and I/O Address Spaces 
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3.4.2 Memory Address Space 


The processor directly addresses up to 4GB of 
physical memory even though the memory 
controller addresses only 128MB of DRAM. Much 
of the other 4GB can be on PCI. Memory address 
space is accessed as bytes, words (16 bits) or 
DWORDs (82 bits). Words and DWORDs are 
stored in consecutive memory bytes with the low- 
order byte located in the lowest address. The phys- 
ical address of a word or DWORD is the byte 
address of the low-order byte. 


The processor allows memory to be addressed 
using nine different addressing modes. These 
addressing modes are used to calculate an offset 
address, often referred to as an effective address. 
Depending on the operating mode of the CPU, the 
offset is then combined, using memory manage- 
ment mechanisms, into a physical address that is 
applied to the physical memory devices. 


Memory management mechanisms consist of 
segmentation and paging. Segmentation allows 
each program to use several independent, 
protected address spaces. Paging translates a 
logical address into a physical address using trans- 
lation lookup tables. Virtual memory is often imple- 
mented using paging. Either or both of these 
mechanisms can be used for management of the 
MediaGX processor memory address space. 


3.5 Offset, Segment, and Paging 
Mechanisms 


The mapping of address space into a sequence of 
memory locations (often cached) is performed by 
the offset, segment and paging mechanisms. 


In general, the offset, segment and paging mecha- 
nisms work in tandem as shown below: 


instruction offset = offset mechanism => offset address 
offset address = segment mechanism => linear address 
linear address => paging mechanism => physical page. 


As will be explained, the actual operations depend 
on several factors such as the current operating 
mode and if paging is enabled. Note: the paging 
mechanism uses part of the linear address as an 
offset on the physical page. 
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3.6 Offset Mechanism 


In all operating modes, the offset mechanism 
computes an offset (effective) address by adding 
together up to three values: a base, an index anda 
displacement. The base, if present, is the value in 
one of eight general registers at the time of the 
execution of the instruction. The index, like the 
base, is a value that is contained in one of the 
general registers (except the ESP register) when 
the instruction is executed. The index differs from 
the base in that the index is first multiplied by a 
scale factor of 1, 2, 4 or 8 before the summation is 
made. The third component added to the memory 
address calculation is the displacement that is a 
value supplied as part of the instruction. Figure 3-3 
illustrates the calculation of the offset address. 


Nine valid combinations of the base, index, scale 
factor and displacement can be used with the CPU 
instruction set. These combinations are listed in 
Table 3-18. The base and index both refer to 
contents of a register as indicated by [Base] and 
[Index]. 


Table 3-18 Memory Addressing Modes 


Offset Mechanism 3 


In real mode operation, the CPU only addresses 
the lowest 1MB of memory and the offset contains 
16-bits. In protective mode the offset contains 32 
bits. Initialization and transition to protective mode 
is described in Section 3.13.4 “Initialization and 
Transition to Protected Mode” on page 99. 


Index 


Base Displacement 


Scaling 
x1, x2, x4, x8 


Offset Address 
(Effective Address) 


Figure 3-3 Offset Address Calculation 


Scale 
Factor Displacement Offset Address (OA) 
Addressing Mode Base | Index (SF) (DP) Calculation 

Direct X OA = DP 
Register Indirect Xx OA = [BASE] 
Based X X OA = [BASE] + DP 
Index X X OA = [INDEX] + DP 
Scaled Index Xx X X OA = ({INDEX] * SF) + DP 
Based Index X X OA = [BASE] + [INDEX] 
Based Scaled Index Xx X Xx OA = [BASE] + ({INDEX] * SF) 
Based Index with X Xx X OA = [BASE] + [INDEX] + DP 
Displacement 
Based Scaled Index X X Xx X OA = [BASE] + ({INDEX] * SF) + DP 
with Displacement 
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3.7 Descriptors and Segment 
Mechanisms 


Memory is divided into contiguous regions called 
“segments.” The segments allow the partitioning of 
individual elements of a program. Each segment 
provides a zero address-based private memory for 
such elements as code, data and stack space. 


The segment mechanisms select a segment in 
memory. Memory is divided into an arbitrary 
number of segments, each containing usually 
much less than the 2°° byte (4 GByte) maximum. 


There are two segment mechanisms, one for Real 
and Virtual 8086 Operating Modes, and one for 
Protective Mode. 


3.7.1 Real and Virtual 8086 Mode 


Segment Mechanisms 


Real Mode Segment Mechanism 


In real mode operation, the CPU addresses only 
the lowest 1MB of memory. In this mode a selector 


located in a one of the segment registers is used to 
locate a segment. 


To calculate a physical memory address, the 16-bit 
segment base address located in the selected 
segment register is multiplied by 16 and then a 16- 
bit offset address is added. The resulting 20-bit 
address is then extended with twelve zeros in the 
upper address bits to crate 32-bit physical address. 


The value of the selector (the INDEX field) is multi- 
plied by 16 to produce a base address (Figure 3-4.) 
The base address is summed with the instruction 
offset value to produce a physical address. 


Virtual 8086 Mode Segment Mechanism 

In Virtual 8086 mode the operation is performed as 
in real mode except that a paging mechanism is 
added. When paging is enabled, the paging 
mechanism translates the linear address into a 
physical address using cached look-up tables 
(refer to Section 3.9 “Paging Mechanism” on page 
80). 


Offset Address 16 


Offset Mechanism 


000h | 12 High Order Address Bits 


Linear Address 


Selected Segment Ne 20 


Register 


(Physical Address) 


Base Address 


Figure 3-4 Real Mode Address Calculation 
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3.7.2 Segment Mechanism in 


Protective Mode 


The segment mechanism in protective mode is 
more complex. Basically as in Real and Virtual 
8086 modes the offset address is added to the 
segment base address to produce a linear address 
(Figure 3-5). However, the calculation of the 
segment base address is based on the contents of 
descriptor tables. 


Again, if paging is enabled the linear address is 
further processed by the paging mechanism. 


A more detailed look at the segment mechanisms 
for real, virtual 8086 and protective modes is illus- 
trated in Figure 3-6. In protective mode, the 
segment selector is cached. This is illustrated in 
Figure 3-7 on page 71. 


3.7.2.1| Segment Selectors 


The segment registers are used to store segment 
selectors. In protective mode, the segment 


selectors are divided in to three fields: the RPL, TI 
and INDEX fields as shown in Figure 3-6. 


The segments are assigned permission levels to 
prevent application program errors from disrupting 
operating programs. The Requested Privilege Level 
(RPL) determines the Effective Privilege Level of an 
instruction. RPL = 0 indicates the most privileged 
level, and RPL = 3 indicates the least privileged level. 
Refer to Section 3.13 “Protection” on page 97. 


Descriptor tables hold descriptors that allow 
management of segments and tables in address 
space while in protective mode. The Table Indi- 
cator Bit (Tl) in the selector selects either the 
General Descriptor Table (GDT) or one Local 
Descriptor Tables (LDT) tables. If Tl = 0, GDT is 
selected; if Tl =1, LDT is selected. The 13-bit 
INDEX field in the segment selector is used to 
index a GDT or LDT table. 


32 Offset Address 


Offset Mechanism 


Segment Base 
32 Address 


Selector Mechanism 


Linear : 
32 Address Optional 32 
Paging Mechanism 


Physical 
Memory 
Address 


Figure 3-5 Protected Mode Address Calculation 
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< Logical Address > 
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INDEX INSTRUCTION OFFSET 
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Address ae 


|____plx 16 > pI >| 
Base Linear LP Physical Segment 


Address Address Address 
p= Paging Mechanism for Virtual 8086 Mode only 


Main Memory 


Real and Virtual 8086 Modes 


< Logical Address > 
<¢—— Segment Selector —————————_» 


15 321 0 
INDEX Tl |RPL INSTRUCTION OFFSET 


-— 


Segment Descriptor > >| 
= mee E Base *L* Linear? Physical Segment 
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GDT or LDT Descriptor Table p = Paging Mechanism Main Memory 


Protective Mode 


Figure 3-6 Selector Mechanisms 


Page 70 Cyrix Corporation Confidential GXm_db_v2.0 


Descriptors and Segment Mechanisms 3 


Selector Load Instruction 


Segment Register 
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Figure 3-7 Selector Mechanism Caching 
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3.7.3 GDTR and LDTR Registers 


The GDT, and LDT descriptor tables are defined by 
the Global Descriptor Table Register (GDTR) and 
the Local Descriptor Table Register (_DTR) respec- 
tively. Some texts refer to these registers as GDT, 
and LDT descriptors. 


The following instructions are used in conjunction 
with the GDTR and LDTR registers: 


« LGDT - Load memory to GDTR 
« LLDT - Load memory to LDTR 
* SGDT - Store GDTR to memory 
« SLDT - Store LDTR to memory 


The GDTR is set up in REAL mode using the 
LGDT instruction. This is possible as the LGDT 
instructions are one of two instructions that directly 
load a linear address (instead of a segment relative 
address) in protective mode. (The other instruction 
is the Load Interrupt Descriptor Table [LIDT]). 


As shown in Table 3-19, the GDTR registers 
contain a BASE ADDRESS field and a LIMIT field 
to that define the GDT tables. (The IDTR register is 
described in Section 3.7.3.2 “Task, Gate and Inter- 
rupt Descriptors” on page 73.) 


Also shown in Table 3-19, the LDTR is only two 
bytes wide as it contains only a SELECTOR field. 


Table 3-19 GDTR, LDTR and IDTR Registers 


Descriptors and Segment Mechanisms 


The contents of the SELECTOR field points to a 
descriptor in the GDT table. 


3.7.3.1 Segment Descriptors 

There are several types of descriptors. A segment 
descriptor defines the base address, limit and 
attributes of a memory segment. 


The GDT or LDT table can hold several types of 
descriptors. In particular, the segment descriptors 
are stored in either of two registers, the GDT, or the 
LDT as shown in Table 3-19). Either of these tables 
can store as many as 8,192 (2'3) eight-byte selec- 
tors taking as much as 64KB of memory. 


The first descriptor in the GDT (location 0) is not 
used by the CPU and is referred to as the “null 
descriptor.” 


Types of Segment Descriptors 

The type of memory segments are defined as 
defined by corresponding types of segment 
descriptors: 


* Code Segment Descriptors 
* Data Segment Descriptors 
* Stack Segment Descriptors 
« LDT Segment Descriptors 
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3.7.3.2 Task, Gate and Interrupt 
Descriptors 


Besides segment descriptors there are descriptors 
used in task switching, switching between tasks 
with different priority and those used to control 
interrupt functions: 


* Task State Segment Table Descriptors 
* Gate Table Descriptors 
¢ Interrupt Descriptors. 


All descriptors some things in common. They are 
all eight bytes in length and have three fields in 
(BASE, LIMIT and TYPE). The BASE field defines 
the starting location for the table or segment. The 
LIMIT field defines the size and the TYPE field 
depends on the type of descriptor. One of the main 
functions of the TYPE field is to define the access 
rights to the associated segment or table. 


Interrupt Descriptor Table 


The Interrupt Descriptor Table is an array of 256 8- 
byte (4-byte for real mode) interrupt descriptors, 
each of which is used to point to an interrupt 
service routine. Every interrupt that may occur in 
the system must have an associated entry in the 
IDT. The contents of the IDTR are completely 


visible to the programmer through the use of the 
SIDT instruction. 


The IDT descriptor table is defined by the Interrupt 
Descriptor Table Register (IDTR). Some texts refer 
to this register as an IDT descriptor. 


The following instructions are used in conjunction 
with the IDTR registers: 


¢ LIDT - Load memory to IDTR 
* SIDT - Store IDTR to memory 


The IDTR is set up in REAL mode using the LIDT 
instruction. This is possible as the LIDT instruc- 
tions is only one of two instructions that directly 
load a linear address (instead of a segment relative 
address) in protective mode. 


As previously shown in Table 3-19, the IDTR 
register contains a BASE ADDRESS field and a 
LIMIT field that define the IDT tables. 


3.7.4 Descriptor Bit Structure 


The bit structure for application and system 
descriptors is shown in Table 3-20. The explana- 
tion of the TYPE field is shown in Table 3-22. 


Table 3-20 Application and System Segment Descriptors 


Memory Offset +4 


31 | 31) 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20/19/18) 17) 16) 15) 14/13/12/11)/10) 9 | 8 


7;6/5/4)/3);2)1/0 


BASE[31:24] G|D/0/]A | LIMIT[19:16] | P| DPL |S TYPE BASE[23:16] 
Vv 
L 
Memory Offset +0 
BASE[15:0] LIMIT[15:0] 
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Table 3-21 Application and System Segment Descriptors Bit Definitions 


Memory 
Bit Offset Name _ | Description 
31:24 +4 BASE _ | Segment Base Address: Three fields which collectively define the base location for the segment 
7:0 44 in 4GB physical address space. 
31:16 +0 
19:16 +4 LIMIT | Segment Limit: Two fields that define the size of the segment based on the Segment Limit 
15:0 +0 Granularity Bit. 
If G = 1: Limit value interpreted in units of 4KB. 
If G = 0: Limit value is interpreted in bytes. 
23 +4 G Segment Limit Granularity Bit: Defines LIMIT multiplier. 
If G = 1: Limit value interpreted in units of 4KB. Segment size ranges from 1 byte to 1MB. 
If G = 0: Limit value is interpreted in bytes. Segment size ranges from 4KB to 4GB. 
22 +4 D Default Length for Operands and Effective Addresses: 
If D = 1: Code segment = 32-bit length for operands and effective addresses 
If D = 0: Code segment = 16-bit length for operands and effective addresses 
If D = 1: Data segment = Pushes, calls and pop instructions use 32-bit ESP register 
If D = 0: Data segment = Stack operations use 16-bit SP register 
20 +4 AVL Segment Available: This field is available for use by system software. 
15 +4 P Segment Present: 
If = 1: Segment is memory segment allocated. 
If = 0: The BASE and LIMIT fields become available for use by the system. Also, If = 0, a segment- 
not-present exception generated when selector for the descriptor is loaded into a segment register 
allowing virtual memory management. 
14:13 +4 DPL Descriptor Privilege Level: 
If = 00: Highest privilege level 
If = 11: Low privilege level 
12 +4 iS) Descriptor Type: 
If = 1: Code or data segment 
If = 0: System segment 
11:8 +4 TYPE | Segment Type - Refer to Table 3-22 for TYPE bit definitions. 
Bit 11 = Executable 
Bit 10 = Conforming if bit 12 = 1 
Bit 10 = Expand Down if bit 12 = 0 
Bit 9 = Readable, if Bit 12 = 1 
Bit 9 = Writable, if Bit 12 =0 
Bit 8 = Accessed 
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Table 3-22 Application and System Segment Descriptors TYPE Bit Definitions 


TYPE System Segment and Gate Types Application Segment Types 
Bits [11:8] Bit 12 =0 Bit 12 =1 
Num SEWA TYPE (Data Segments) 

0 0000 Reserved Data Read-Only 

1 0001 Available 16-Bit TSS Data Read-Only, accessed 

2 0010 LDT Data Read/Write 

3 0011 Busy 16-Bit TSS Data Read/Write accessed 

4 0100 16-Bit Call Gate Data Read-Only, expand down 

5 0101 Task Gate Data Read-Only, expand down, accessed 

6 0110 16-Bit Interrupt Gate Data Read/Write, expand down 

7 0111 16-Bit Trap Gate Data Read/Write, expand down, accessed 

Num SCRA TYPE (Code Segments) 

8 1000 Reserved Code Execute-Only 

9 1001 Available 32-Bit TSS Code Execute-Only, accessed 

A 1010 Reserved Code Execute/Read 

B 1011 Busy 32-Bit TSS Code Execute/Read, accessed 

Cc 1100 32-Bit Call Gate Code Execute/Read, conforming 

D 1101 Reserved Code Execute/Read, conforming, accessed 

E 1110 32-Bit Interrupt Gate Code Execute/Read-Only, conforming 

F 1111 32-Bit Trap Gate Code Execute/Read-Only, conforming accessed 
S = Code Segment (not Data Segment) A = Accessed 
E = Expand Down C = Conforming Code Segment 
W = Write Enable R = Read Enable 
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3.7.5 Gate Descriptors 


Four kinds of gate descriptors are used to provide 
protection during control transfers: call gates, trap 
gates, interrupt gates and task gates. (For more 
information on protection refer to Section 3.13 
“Protection” on page 97.) 


Call Gate Descriptor (CGD). Call gates are used 
to define legal entry points to a procedure with a 
higher privilege level. The call gates are used by 
CALL and JUMP instructions in much the same 
manner as code segment descriptors. When the 
CPU decodes an instruction and sees it refers toa 
call gate descriptor in the GDT table or a LDT 
table, the call gate is used to point to another 
descriptor in the table that defines the destination 
code segment. 


The following privilege levels are tested during the 
transfer through the call gate: 
¢ CPL = Current Privilege Level 


Table 3-23 Gate Descriptors 


xX Descriptors and Segment Mechanisms 


« RPL = Segment Selector Field 

¢ DPL = Descriptor Privilege Level in the call gate 
descriptor. 

¢ DPL = Descriptor Privilege Level in the destina- 
tion code segment. 


The maximum value of the CPL and RPL must be 
equal or less than the gate DPL. For a JMP 
instruction the destination DPL equals the CPL. 
For a CALL instruction the destination DPL is less 
or equals the CPL. 


Conforming Code Segments. Transfer to a 
procedure with a higher privilege level can also be 
accomplished by bypassing the use of call gates, if 
the requested procedure is to be executed ina 
conforming code segment. Conforming code 
segments have the C bit set in the TYPE field in 
their descriptor. 


The bit structure and definitions for gate descrip- 
tors are shown in Tables 3-23 and 3-24. 


Table 3-24 Gate Descriptors Bit Definitions 


31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20/19] 18) 17| 16) 15}14/13/12/11)10;9|/8)/7/6/5)4})/3/;2/1)0 
Memory Offset +4 

OFFSET[31:16] P| DPL | 0 TYPE 0|0{}{0] PARAMETERS 
Memory Offset +0 

SELECTOR[15:0] OFFSET[15:0] 


Memory 
Bit Offset Name Description 
31:16 +4 OFFSET Offset: Offset used during a call gate to calculate the branch target. 
15:0 +0 
31:16 +0 SELECTOR | Segment Selector 
15 +4 P Segment Present 
14:13 +4 DPL Descriptor Privilege Level 
11:8 +4 TYPE Segment Type: 
0100 = 16-bit call gate 1100 = 32-bit call gate 
0101 = Task gate 1110 = 32-bit interrupt gate 
0110 = 16-bit interrupt gate 1111 = 32-bit trap gate 
0111 = 16-bit trap gate 
4:0 +4 PARAMETERS | Parameters: Number of parameters to copy from the caller’s stack to the called proce- 
dure’s stack. 
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3.8 Multitasking and Task State 
Segments 


The CPU enables rapid task switching using JMP 
and CALL instructions that refer to Task State 
Segments (TSS). During a switch, the complete 
task state of the current task is stored in its TSS, 
and the task state of the requested task is loaded 
from its TSS. The TSSs are defined through 
special segment descriptors and gates. 


The Task Register (TR) holds 16-bit descriptors 
that contain the base address and segment limit for 
each task state segment. The TR is loaded and 
stored via the LTR and STR instructions, respec- 
tively. The TR can only be accessed only during 
protected mode and can be loaded when the privi- 
lege level is 0 (most privileged). When the TR is 
loaded, the TR selector field indexes a TSS 
descriptor that must reside in the Global Descriptor 
Table (GDT). 


Only the 16-bit selector of a TSS descriptor in the 
TR is accessible. The BASE, TSS LIMT and 
ACCESS RIGHT fields are program invisible. 


During task switching, the processor saves the 
current CPU state in the TSS before starting a new 
task. The TSS can be either a 386/486-type 32-bit 
TSS (see Table 3-25) or a 286-type 16-bit TSS (see 
Table 3-26). 


Task Gate Descriptors. A task gate descriptor 
provides controlled access to the descriptor for a 
task switch. The DPL of the task gate is used to 
control access. The selector’s RPL and the CPL of 
the procedure must be a higher level (numerically 
less) than the DPL of the descriptor. The RPL in 
the task gate is not used. 


The I/O Map Base Address field in the 32-bit TSS 
points to an I/O permission bit map that often 
follows the TSS at location +68h. 
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Table 3-25 32-Bit Task State Segment (TSS) Table 


31 16/15 0 
\/O Map Base Address 0;0/0/0/0)/0}0;/0)/0)/0}0;0;0/0]0;T] +64h 
0/;0/0;0;0;0;0;0)/0;0};0}/0}0;0};0)]0 Selector for Task’s LDT +60h 
0/0;/0;0;0/0}0/0}0/0)0/0]/0}0])0}]0 GS +5Ch 
0/0;/0;0;0/0}0;/0}/0/0;0/0/0;}0])0]0 FS +58h 
0/0;/0;0;/0/0};0/0}0/0}0/0/0}0])0}]0 DS +54h 
0/0;/0;0;/0/0}0/0}/0/0}0/0/0;}0)0]0 Ss +50h 
0/0;/0;0;/0/0}0/0}/0/0}0/0]/0}0])0]0 cs +4Ch 
0/0;/0;0;0/0}0/0}0/0}0/0]/0;}0])0}]0 ES +48h 
EDI +44h 
ESI +40h 
EBP +3Ch 
ESP +38h 
EBX +34h 
EDX +30h 
ECX +2Ch 
EAX +28h 
EFLAGS +24h 
EIP +20h 
CR3 +1Ch 
0/0/;0;0;0/0}0/0}/0/0}0/0]0}0])0]0 SS for CPL = 2 +18h 
ESP for CPL = 2 +14h 
0/0;/0;0;/0/0}/0/0}/0/0}0/0/0}0])0]0 SS for CPL = 1 +10h 
ESP for CPL = 1 +Ch 
0/0;/0;0;0/0}/0/0}/0/0}0/0/0}0)0}]0 SS for CPL = 0 +8h 
ESP for CPL = 0 +4h 
0/0;/0;0;0/0}0/0}/0/0);0/0]/0}0])0]0 Back Link (Old TSS Selector) +0h 


Note: 0 = Reserved 
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Table 3-26 16-Bit Task State Segment (TSS) Table 


15 0 
Selector for Task’s LDT +2Ah 
DS +28h 
Ss +26h 
CS +24h 
ES +22h 
DI +20h 
Sl +1Eh 
BP +1Ch 
SP +1Ah 
BX +18h 
DX +16h 
CX +14h 
AX +12h 
FLAGS +10h 
IP +Eh 
SS for Privilege Level 0 +Ch 
SP for Privilege Level 1 +Ah 
SS for Privilege Level 1 +8h 
SP for Privilege Level 1 +6h 
SS for Privilege Level 0 +4h 
SP for Privilege Level 0 +2h 
Back Link (Old TSS Selector) +0h 
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3.9 Paging Mechanism 


The paging mechanism either translates a linear 
address to its corresponding physical address. If 
the required page is not currently present in RAM, 
an exception is generated. When the operating 
system services the exception, the required page 
can be loaded into memory and the instruction 
restarted. Pages are either 4KB or 1MB in size. 
The CPU defaults to 4KB pages that are aligned to 
4KB boundaries. 


A page is addressed by using two levels of tables 
as illustrated in Figure 3-8. Bits[31:22] of the 32-bit 
linear address, the Directory Table Index (DTI) are 


used to locate an entry in the page directory table. 
The page directory table acts as a 32-bit master 
index to up to 1K individual second-level page 
tables. The selected entry in the page directory 
table, referred to as the directory table entry (DTE), 
identifies the starting address of the second-level 
page table. The page directory table itself is a page 
and is, therefore, aligned to a 4KB boundary. The 
physical address of the current page directory table 
is stored in the CR3 control register, also referred 
to as the Page Directory Base Register (PDBR). 


Linear 


Address 
31 22 21 


DTE Cache 
2-Entry 
Fully Associative 


orat—+[_ do 


Control Directory Table 
Register 


Directory Table Index Page Table Index Page Frame Offset 
(DTI) (PTI) (PFO) 


12 11 0 


4GB 


-4KB 
Physical Page 
| 


Main TLB 
32-Entry 
4-Way Set 
Associative 


Page Table Memory 


External Memory 


Figure 3-8 Paging Mechanism 
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Bits [21:12] of the 32-bit linear address, referred to 
as the Page Table Index (PTI), locate a 32-bit entry 
in the second-level page table. This Page Table 
Entry (PTE) contains the base address of the 
desired page frame. The second-level page table 
addresses up to 1K individual page frames. A 
second-level page table is 4KB in size and is itself 
a page. Bits [11:0] of the 32-bit linear address, the 
Page Frame Offset (PFO), locate the desired phys- 
ical data within the page frame. 


Since the page directory table can point to 1K page 
tables, and each page table can point to 1K page 
frames, a total of 1M page frames can be imple- 
mented. Since each page frame contains 4KB, up 
to 4GB of virtual memory can be addressed by the 
CPU with a single page directory table. 


Along with the base address of the page table or 
the page frame, each directory table entry or page 
table entry contains attribute bits and a present bit 
as illustrated in Table 3-27. 


If the present bit (P) is set in the DTE, the page 
table is present and the appropriate page table 
entry is read. If P = 1 in the corresponding PTE 
(indicating that the page is in memory), the 
accessed and dirty bits are updated, if necessary, 
and the operand is fetched. Both accessed bits are 
set (DTE and PTE), if necessary, to indicate that 
the table and the page have been used to translate 
a linear address. The dirty bit (D) is set before the first 
write is made to a page. 


The present bits must be set to validate the 
remaining bits in the DTE and PTE. If either of the 
present bits are not set, a page fault is generated 
when the DTE or PTE is accessed. If P = 0, the 
remaining DTE/PTE bits are available for use by 
the operating system. For example, the operating 
system can use these bits to record where on the 
hard disk the pages are located. A page fault is 
also generated if the memory reference violates 
the page protection attributes. 


Table 3-27 Directory Table Entry (DTE) and Page Table Entry (PTE) 


Bit Name Description 
31:12 BASE Base Address: Specifies the base address of the page or page table. 
ADDRESS 
11:9 AVAILABLE | Available: Undefined and Available to the Programmer 
8:7 RSVD Reserved: Unavailable to programmer 
6 D Dirty Bit: 
PTE format — If = 1: Indicates that a write access has occurred to the page. 
DTE format — Reserved. 
5 A Accessed Flag: If set, indicates that a read access or write access has occurred to the page. 
4:3 RSVD Reserved: Set to 0. 
2 U/S User/Supervisor Attribute: 
If = 1: Page is accessible by User at privilege level 3. 
If = 0: Page is accessible by Supervisor only when CPL < 2. 
1 W/R Write/Read Attribute: 
If = 1: Page is writable. 
If = 0: Page is read only. 
0 P Present Flag: 
If = 1: The page is present in RAM and the remaining DTE/PTE bits are validated 
If = 0: The page is not present in RAM and the remaining DTE/PTE bits are available for use by the 
programmer. 
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Translation Look-Aside Buffer 

The translation look-aside buffer (TLB) is a cache 
for the paging mechanism and replaces the two- 
level page table lookup procedure for TLB hits. The 
TLB is a four-way set associative 32-entry page 
table cache that automatically keeps the most 
commonly used page table entries in the 
processor. The 32-entry TLB, coupled with a 4K 
page size, results in coverage of 128KB of memory 
addresses. 


The TLB must be flushed when entries in the page 
tables are changed. The TLB is flushed whenever 
the CR8 register is loaded. An individual entry in 
the TLB can be flushed using the INVLPG instruc- 
tion. 


DTE Cache 

The DTE cache caches the two most recent DTEs 
so that future TLB misses only require a single 
page table read to calculate the physical address. 
The DTE cache is disabled following reset and can 
be enabled by setting the DTE_EN bit in CCR4[4] 
(Index E8h). 


3.10 Interrupts and Exceptions 


The processing of either an interrupt or an excep- 
tion changes the normal sequential flow of a 
program by transferring program control to a 
selected service routine. Except for SMM inter- 
rupts, the location of the selected service routine is 
determined by one of the interrupt vectors stored in 
the interrupt descriptor table. 


True interrupts are hardware interrupts and are 
generated by signal sources external to the CPU. 
All exceptions (including so-called software interrupts) 
are produced internally by the CPU. 


The INTR interrupt is unmasked when the Inter- 
rupt Enable Flag (IF, bit 9) in the EFLAGS register 
is set to 1. Except for string operations, INTR inter- 
rupts are acknowledged between instructions. 
Long string operations have interrupt windows 


3.10.1. Interrupts 


External events can interrupt normal program 
execution by using one of the three interrupt pins 
on the MediaGX processor: 


¢ Non-maskable Interrupt (NMI pin) 
¢ Maskable Interrupt (INTR pin) 
¢ SMM Interrupt (SMI# pin) 


For most interrupts, program transfer to the inter- 
rupt routine occurs after the current instruction has 
been completed. When the execution returns to the 
original program, it begins immediately following 
the interrupted instruction. 


The NMI interrupt cannot be masked by software 
and always uses interrupt vector 2 to locate its 
service routine. Since the interrupt vector is fixed 
and is supplied internally, no interrupt acknowledge 
bus cycles are performed. This interrupt is normally 
reserved for unusual situations such as parity 
errors and has priority over INTR interrupts. 


Once NMI processing has started, no additional 
NMls are processed until an IRET instruction is 
executed, typically at the end of the NMI service 
routine. If NMI is re-asserted before execution of 
the IRET instruction, one and only one NMI rising 
edge is stored and then processed after execution 
of the next IRET. 


During the NMI service routine, maskable inter- 
rupts may be enabled. If an unmasked INTR 
occurs during the NMI service routine, the INTR is 
serviced and execution returns to the NMI service 
routine following the next IRET. If a HALT instruc- 
tion is executed within the NMI service routine, the 
CPU restarts execution only in response to 
RESET, an unmasked INTR or a System Manage- 
ment Mode (SMM) interrupt. NMI does not restart 
CPU execution under this condition. 


between memory moves that allow INTR interrupts 
to be acknowledged. 


When an INTR interrupt occurs, the CPU performs 
an interrupt-acknowledge bus cycle. During this 
cycle, the CPU reads an 8-bit vector that is 
supplied by an external interrupt controller. This 
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vector selects which of the 256 possible interrupt 
handlers will be executed in response to the inter- 
rupt. 


The SMM interrupt has higher priority than either 
INTR or NMI. After SMI# is asserted, program 
execution is passed to an SMI service routine that 
runs in SMM address space reserved for this 
purpose. The remainder of this section does not 
apply to the SMM interrupts. SMM interrupts are 
described in greater detail later in this section. 


3.10.2 Exceptions 


Exceptions are generated by an interrupt instruc- 
tion or a program error. Exceptions are classified 
as traps, faults or aborts depending on the mecha- 
nism used to report them and the restartability of 
the instruction which first caused the exception. 


A Trap exception is reported immediately 
following the instruction that generated the trap 
exception. Trap exceptions are generated by 
execution of a software interrupt instruction (INTO, 
INT3, INTn, BOUND), by a single-step operation or 
by a data breakpoint. 


Software interrupts can be used to simulate hard- 
ware interrupts. For example, an INTn instruction 
causes the processor to execute the interrupt 
service routine pointed to by the nth vector in the 
interrupt table. Execution of the interrupt service 
routine occurs regardless of the state of the IF flag 
(bit 9) in the EFLAGS register. 


The one byte INT3, or breakpoint interrupt (vector 
3), is a particular case of the INTn instruction. By 
inserting this one byte instruction in a program, the 
user can set breakpoints in the code that can be 
used during debug. 


Single-step operation is enabled by setting the TF 
bit (bit 8) in the EFLAGS register. When TF is set, 
the CPU generates a debug exception (vector 1) 
after the execution of every instruction. Data break- 
points also generate a debug exception and are 
specified by loading the debug registers (DRO- 
DR7) with the appropriate values. 


A Fault exception is reported before completion of 
the instruction that generated the exception. By 
reporting the fault before instruction completion, 
the CPU is left in a state that allows the instruction 
to be restarted and the effects of the faulting 
instruction to be nullified. Fault exceptions include 
divide-by-zero errors, invalid opcodes, page faults 
and coprocessor errors. Debug exceptions (vector 
1) are also handled as faults (except for data 
breakpoints and single-step operations). After 
execution of the fault service routine, the instruc- 
tion pointer points to the instruction that caused the 
fault. 


An Abort exception is a type of fault exception 
that is severe enough that the CPU cannot restart 
the program at the faulting instruction. The double 
fault (vector 8) is the only abort exception that 
occurs on the CPU. 


3.10.3 Interrupt Vectors 


When the CPU services an interrupt or exception, 
the current program’s instruction pointer and flags 
are pushed onto the stack to allow resumption of 
execution of the interrupted program. In protected 
mode, the processor also saves an error code for 
some exceptions. Program control is then trans- 
ferred to the interrupt handler (also called the inter- 
rupt service routine). Upon execution of an IRET at 
the end of the service routine, program execution 
resumes at the instruction pointer address saved 
on the stack when the interrupt was serviced. 


3.10.3.1 Interrupt Vector Assignments 


Each interrupt (except SMI#) and exception is 
assigned one of 256 interrupt vector numbers as 
shown in Table 3-28. The first 32 interrupt vector 
assignments are defined or reserved. INT instruc- 
tions acting as software interrupts may use any of 
interrupt vectors, 0 through 255. 


The non-maskable hardware interrupt (NMI) is 
assigned vector 2. Illegal opcodes including faulty 
FPU instructions will cause an illegal opcode 
exception, interrupt vector 6. NMI interrupts are 
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enabled by setting bit 2 of the CCR7 register (Index 
EBh[2] = 1, see Table 3-11 on page 54 for register 
format). 


In response to a maskable hardware interrupt 
(INTR), the CPU issues interrupt acknowledge bus 
cycles used to read the vector number from external 
hardware. These vectors should be in the range 32 
to 255 as vectors 0 to 31 are predefined. In PCs, 
vectors 8 through 15 are used. 


3.10.3.2 Interrupt Descriptor Table 


The interrupt vector number is used by the CPU to 
locate an entry in the interrupt descriptor table 
(IDT). In real mode, each IDT entry consists of a 
four-byte far pointer to the beginning of the corre- 
sponding interrupt service routine. In protected 
mode, each IDT entry is an 8-byte descriptor. The 
Interrupt Descriptor Table Register (IDTR) speci- 
fies the beginning address and limit of the IDT. 
Following reset, the IDTR contains a base address 
of Oh with a limit of 3FFh. 


The IDT can be located anywhere in physical 
memory as determined by the IDTR register. The 
IDT may contain different types of descriptors: 
interrupt gates, trap gates and task gates. Interrupt 
gates are used primarily to enter a hardware inter- 
rupt handler. Trap gates are generally used to 
enter an exception handler or software interrupt 
handler. If an interrupt gate is used, the Interrupt 
Enable Flag (IF) in the EFLAGS register is cleared 
before the interrupt handler is entered. Task gates 
are used to make the transition to a new task. 


Table 3-28 Interrupt Vector Assignments 


Table 3-28 Interrupt Vector Assignments 


Interrupt Exception 
Vector Function Type 

0 Divide error Fault 

1 Debug exception Trap/Fault* 
2 NMI interrupt 

3 Breakpoint Trap 

4 Interrupt on overflow Trap 

5 BOUND range exceeded Fault 

6 Invalid opcode Fault 


Interrupt Exception 
Vector Function Type 

7 Device not available Fault 

8 Double fault Abort 

9 Reserved 
10 Invalid TSS Fault 
11 Segment not present Fault 
12 Stack fault Fault 
13 General protection fault Trap/Fault 
14 Page fault Fault 
15 Reserved 
16 FPU error Fault 
17 Alignment check exception Fault 
18:31 Reserved 
32:55 Maskable hardware interrupts Trap 
0:255 Programmed interrupt Trap 
Note: *Data breakpoints and single steps are traps. All 


other debug exceptions are faults. 
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3.10.4 Interrupt and Exception 
Priorities 

As the CPU executes instructions, it follows a 
consistent policy for prioritizing exceptions and 
hardware interrupts. The priorities for competing 
interrupts and exceptions are listed in Table 3-29. 
SMM interrupts always take precedence. Debug 
traps for the previous instruction and next instruc- 
tions are handled as the next priority. When NMI 
and maskable INTR interrupts are both detected at 
the same instruction boundary, the MediaGX 
processor services the NMI interrupt first. 


The CPU checks for exceptions in parallel with 
instruction decoding and execution. Several excep- 


Table 3-29 Interrupt and Exception Priorities 


tions can result from a single instruction. However, 
only one exception is generated upon each attempt 
to execute the instruction. Each exception service 
routine should make the appropriate corrections to 
the instruction and then restart the instruction. In 
this way, exceptions can be serviced until the 
instruction executes properly. 


The CPU supports instruction restart after all faults, 
except when an instruction causes a task switch to 
a task whose task state segment (TSS) is partially 
not present. A TSS can be partially not present if 

the TSS is not page aligned and one of the pages 
where the TSS resides is not currently in memory. 


Priority Description Notes 

0 Warm Reset. Caused by the assertion of WM_RST. 

1 SMM hardware interrupt. SMM interrupts are caused by SMI# asserted and always have 
highest priority. 

2 Debug traps and faults from previous instruction. | Includes single-step trap and data breakpoints specified in the 
debug registers. 

3 Debug traps for next instruction. Includes instruction execution breakpoints specified in the debug 
registers. 

Non-maskable hardware interrupt. Caused by NMI asserted. 
5 Maskable hardware interrupt. Caused by INTR asserted and IF = 1. 


Faults resulting from fetching the next instruction. 


Includes segment not present, general protection fault and page 
fault. 


Faults resulting from instruction decoding. 


Includes illegal opcode, instruction too long, or privilege violation. 


Device not available exception generated. 


7 
8 WAIT instruction and TS = 1 and MP = 1. 
9 ESC instruction and EM = 1 or TS = 1. 


Device not available exception generated. 


Caused by unmasked floating point exception with NE = 1. 


10 Floating point error exception. 
1 


Segmentation faults (for each memory reference 
required by the instruction) that prevent transfer- 
ring the entire memory operand. 


Includes segment not present, stack fault, and general protection 
fault. 


12 Page Faults that prevent transferring the entire 
memory operand. 


13 Alignment check fault. 
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3.10.5 Exceptions in Real Mode 


Many of the exceptions described in Table 3-28 
"Interrupt Vector Assignments" on page 84 are not 
applicable in real mode. Exceptions 10, 11, and 14 
do not occur in real mode. Other exceptions have 
slightly different meanings in real mode as listed in 


Table 3-30. 


Table 3-30 Exception Changes in Real Mode 


Vector Protected Mode Real Mode 
Number Function Function 
8 Double fault. Interrupt table limit overrun. 
10 Invalid TSS. Does not occur. 
11 Segment not Does not occur. 
present. 
12 Stack fault. SS segment limit overrun. 
13 General protection | CS, DS, ES, FS, GS seg- 
fault. ment limit overrun. In pro- 
tected mode, an error is 
pushed. In real mode, no 
error is pushed. 
14 Page fault. Does not occur. 


Table 3-31 Error Codes 


3.10.6 Error Codes 


When operating in protected mode, the following 
exceptions generate a 16-bit error code: 


* Double Fault 

« Alignment Check 

¢ Invalid TSS 

* Segment Not Present 

¢ Stack Fault 

¢ General Protection Fault 
¢ Page Fault 


The error code format and bit definitions are shown 
in Table 3-31. Bits [15:3] (selector index) are not 
meaningful if the error code was generated as the 
result of a page fault. The error code is always zero 
for double faults and alignment check exceptions. 


15 14 13 12 


11 10 9 8 


Selector Index 


Table 3-32 Error Code Bit Definitions 


$2 $1 SO 


Fault Selector Index 
Type (Bits 15:3) S2 (Bit 2) $1 (Bit 1) SO (Bit 0) 
Page Reserved. Fault caused by: Fault occurred during: Fault occurred during 
Fault 0 = Not present page 0 = Read access 0 = Supervisor access 
1 = Page-level protection 1 = Write access 1 = User access. 
violation 
IDT Index of faulty IDT | Reserved If = 1, exception occurred while 
Fault selector. trying to invoke exception or 
hardware interrupt handler. 
Segment | Index of faulty TI bit of faulty selector If =1, exception occurred while 
Fault selector. trying to invoke exception or 
hardware interrupt handler. 
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3.11. System Management Mode 


System Management Mode (SMM) is usually 
employed for system power management or soft- 
ware-transparent emulation of I/O peripherals. 
SMM mode is entered through a hardware signal 
“System Management Interrupt” (SMI# pin) that 
has a higher priority than any other interrupt, 
including NMI. An SMM interrupt can also be trig- 
gered from software using an SMINT instruction. 
Following an SMM interrupt, portions of the CPU 
state are automatically saved, SMM mode is 


entered, and program execution begins at the base 
of SMM address space (Figure 3-9). 


The MediaGX processor extends System Manage- 
ment Mode (SMM) to support the virtualization of 
many devices, including VGA video. The SMM 
mechanism can be triggered not only by I/O 
activity, but by access to selected memory regions. 
For example, SMM interrupts are generated when 
VGA addresses are accessed. As well be 
described, other SMM enhancements have 
reduced SMM overhead and improved virtualiza- 
tion-software performance. 


Physical 
Memory Space 


FFFF FFFFh 


Physical Memory 
4GB 


0000 0000h 
Non-SMM 


Potential 
SMM Address 
Space 
FFFF FFFFh 


Defined 
SMM 
4KB to 32MB Address 
Space 
0000 0000h 


SMM 


Figure 3-9 System Management Memory Address Space 
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3.11.1 SMM Enhancements Table 3-33 SMI# and SMINT Recognition 
Eight SMM instructions have been added to the Requirements 

x86 instruction set that permit initiating SMM Register Bits SMl# SMINT 
through software and saving and restoring the total USE_SMI, CCRI[1] (Index Cth) 1 1 
CPU state when in SMM. SMAC, CCRI1[2] (Index C1h) 0 1 
The SMM header now: SIZE[3:0], SMAR3[3:0] (Index CFh) >0 >0 


* Stores 32-bits memory addresses. 


¢ Stores 32-bit memory data. 
¢ Differentiates memory and I/O accesses. 


* Indicates if an SMM interrupt was generated by 
access to a VGA region. 


The SMM service code is now cacheable. An 
SMAR register specifies the SMM region code 


SMI# Sampled Active or 
SMINT Instruction Executed 
CPU State Stored in SMM 
Address Space Header 
base and limit. An SMHR register specifies the 
physical address for the SMM header. The 
SMI_NEST bit enables the nesting of SMM inter- 
rupt Program Flow Transfers 
pts. 
to SMM Address Space 


3.11.2 SMM Operation 


SMM execution flow is summarized in Figure 3-10. 
Entering SMM requires the assertion of the SMl# 

pin for at least two SYSCLK periods or execution of 
the SMINT instruction. For the SMI# signal or 


SMINT instruction to be recognized, configuration 
register bits must be set as shown in Table 3-33. ee sn as ode 
(The configuration registers are discussed in detail Dee aera rere 
in Section 3.3.2.2 “Configuration Registers” on 
page 50.) 
RSM Instruction Restores CPU 
State Using Header Information 
Normal Execution Resumes 


Figure 3-10 SMM Execution Flow 
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After triggering an SMM through the SMI# pin or a 
SMINT instruction, selected CPU state information 
is automatically saved in the SMM memory space 
header located at the top of SMM memory space. 
After saving the header, the CPU enters real mode 
and begins executing the SMM service routine 

starting at the SMM memory region base address. 


The SMM service routine is user definable and 
may contain system or power management soft- 
ware. If the power management software forces 
the CPU to power down or if the SMM service 
routine modifies more registers than are automati- 
cally saved, the complete CPU state information 
should be saved. 


3.11.3. The SMI# Pin 


External chipsets can generate an SMI based on 
numerous asynchronous events, including power 
management timers, I/O address trapping, external 
devices, audio FIFO events, and others. Since 
SMI# is edge sensitive, the chipset must generate 
an edge for each of the events above, requiring 
arbitration and storage of multiple SMM events. 
These functions are provided by the Cx5520 / 
Cx5530 devices from Cyrix. The processor gener- 
ates an SMI when the external pin changes from 
high-to-low or when an RSM occurs if SMI# has 
not remained low since the initiation of the previous 
SMI. 


3.11.4 SMM Configuration Registers 


The SMAR register specifies the base location of 
SMM code region and its size limit. This SMAR 
register is identical to many of the Cyrix proces- 
sors. 


A new configuration control register called SMHR 
has been added to specify the 32-bit physical 
address of the SMM header. The SMHR address 
must be 32-bit aligned as the bottom two bits are 
ignored by the microcode. Hardware will detect 
write operations to SMHR, and signal the micro- 
code to recompute the header address. Access to 
these registers is enabled by MAPEN (Index 
C3h[4]). 


The SMAR register writes to the SMM header 
when the SMAR register is changed. For this 
reason, changes to the SMAR register should be 
completed prior to setting up the SMM header. The 
configuration registers bit formats are detailed in 
Table 3-11 on page 52. 
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3.11.5 SMM Memory Space Header 


Tables 3-34 and 3-35 show the SMM header. A 
memory address field has been added to the end 
(offset -40h) of the header for the MediaGX 
processor. Memory data will be stored overlapping 
the I/O data, since these events cannot occur 
simultaneously. The I/O address is valid for both IN 
and OUT instructions, and I/O data is valid only for 
OUT. The memory address is valid for read and 
write operations, and memory data is valid only for 
write operations. 


With every SMI interrupt or SMINT instruction, 
selected CPU state information is automatically 
saved in the SMM memory space header located 
at the top of SMM address space. The header 
contains CPU state information that is modified 
when servicing an SMM interrupt. Included in this 
information are two pointers. The current IP points 


Table 3-34 SMM Memory Space Header 


System Management Mode 


to the instruction executing when the SMI was 
detected, but it is valid only for an internal I/O SMI. 


The Next IP points to the instruction that will be 
executed after exiting SMM. The contents of 
Debug Register 7 (DR7), the Extended Flags 
Register (EFLAGS), and Control Register 0 (CRO) 
are also saved. If SMM has been entered due to an 
I/O trap fora REP INSx or REP OUTSx instruction, 
the Current IP and Next IP fields contain the same 
addresses. In addition, the | and P fields contain 
valid information. 


If entry into SMM is the result of an I/O trap, it is 
useful for the programmer to know the port address, 
data size and data value associated with that I/O 
operation. This information is also saved in the 
header and is valid only if SMl# is asserted during an 
I/O bus cycle. The I/O trap information is not restored 
within the CPU when executing a RSM instruction. 


Mem. 
Offset | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20| 19) 18/17) 16/15/14/13/12}11)/10;9)/8)/7)/6/;5/4/3;2)1)]0 
-Oh DR7 
—4h EFLAGS 
-8h CRO 
—Ch Current IP 
-10h Next IP 
-14h RSVD CS Selector 
-18h CS Descriptor [63:32] 
-1Ch CS Descriptor [31:0] 
—20h RSVD RSVD N|V|X|M]JH;|S/]P] T/C 
—24h /O Data Size I/O Address [15:0] 
—28h 1/0 (Memory) Data [31:0] 
—2Ch Restored ESI or EDI 
-30h /O or Memory Address [31:0] 
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Table 3-35 SMM Memory Space Header Description 


Name Description Size 
DR7 Debug Register 7: The contents of Debug Register 7. 4 Bytes 
EFLAGS Extended Flags Register: The contents of Extended Flags Register. 4 Bytes 
CRO Control Register 0: The contents of Control Register 0. 4 Bytes 
Current IP Current Instruction Pointer: The address of the instruction executed prior to servicing SMM 4 Bytes 
interrupt. 
Next IP Next Instruction Pointer: The address of the next instruction that will be executed after exit- | 4 Bytes 
ing SMM. 
CS Selector Code Segment Selector: Code segment register selector for the current code segment. 2 Bytes 
CS Descriptor Code Segment Descriptor: Encoded descriptor bits for the current code segment. 8 Bytes 
N Nested SMI Status: Flag that determines whether an SMI occurred during SMM (i.e., nested) 1 Bit 
V SoftVGA SMI Status: SMI was generated by an access to VGA region. 1 Bit 
X External SMI Status: 1 Bit 
If = 1: SMI generated by external SMI# pin 
If = 0: SMI internally generated by Internal Bus Interface Unit. 
M Memory or I/O Access: 0 = I/O access; 1 = Memory access. 1 Bit 
H Halt Status: Indicates that the processor was in a halt or shutdown prior to servicing the SMM 1 Bit 
interrupt. 
Ss Software SMM Entry Indicator: 1 Bit 
If = 1:Current SMM is the result of an SMINT instruction. 
If = 0: Current SMM is not the result of an SMINT instruction. 
P REP INSx/OUTS:x Indicator: 1 Bit 
If = 1: Current instruction has a REP prefix. 
If = 0: Current instruction does not have a REP prefix. 
IN, INSx, OUT, or OUTSx Indicator: 1 Bit 
If = 1: Current instruction performed is an I/O WRITE. 
If = 0: Current instruction performed is an I/O READ. 
Cc CS Writable 1 Bit 
I/O Data Size Indicates size of data for the trapped I/O cycle: 2 Bytes 
01h = byte 
03h = word 
OFh = DWORD 
I/O Address Processor port used for the trapped I/O cycle. 2 Bytes 
I/O Write Data Data associated with the trapped I/O write. 4 Bytes 
Restored ESI or EDI Restored ESI or EDI Value: Used when it is necessary to repeat a REP OUTSx or REP INSx 4 Bytes 
instruction when one of the I/O cycles caused an SMI# trap. 
Memory Address Physical address of the write operation that caused the SMI. 4 Bytes 


Note: INSx = INS, INSB, INSW or INSD instruction. 
OUTSx = OUTS, OUTSB, OUTSW and OUTSD instruction. 
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3.11.6 SMM Instructions 


The MediaGX processor core automatically saves the 
minimal amount of CPU state information when 
entering an SMM cycle that allows fast SMM 
service-routine entry and exit. After entering the 
SMM service routine, the MOV, SVDC, SVLDT and 
SVTS instructions can be used to save the 
complete CPU state information. If the SMM 
service routine modifies more state information than 
is automatically saved or if it forces the CPU to 
power down, the complete CPU state information 
must be saved. Since the CPU is a static device, its 
internal state is retained when the input clock is 
stopped. Therefore, an entire CPU-state save is 
not necessary before stopping the input clock. 


Table 3-36 SMM Instruction Set 


System Management Mode 


The SMM instructions, listed in Table 3-36, can be 
executed only if all the conditions listed below are 
met. 


1) USE_SMI = 1. 

2) SMAR SIZE > 0. 

3) Current Privilege level = 0. 

4) SMAC bit is high or the CPU is in an SMI 


service routine. 


If any one of the conditions above is not met and 
an attempt is made to execute an SVDC, RSDC, 
SVLDT, RSLDT, SVTS, RSTS, or RSM instruction, 
an invalid opcode exception is generated. The 
SMM instructions can be executed outside of 
defined SMM space provided the conditions above 
are met. 


Instruction Opcode Format 


Description 


SvDC OF 78h [mod sreg3 r/m] 


SVDC mem8b0, sreg3 


Save Segment Register and Descriptor 
Saves reg (DS, ES, FS, GS, or SS) to mem80. 


RSDC OF 79h [mod sreg3 r/m] 


RSDC sreg3, mem80 


Restore Segment Register and Descriptor 

Restores reg (DS, ES, FS, GS, or SS) from mem80. Use 
RSM to restore CS. 

Note: Processing “RSDC CS, Mem80’ will produce an 
exception. 


SVLDT OF 7Ah [mod 000 r/m] SVLDT mem80 


Save LDTR and Descriptor 
Saves Local Descriptor Table (LDTR) to mem80. 


RSLDT OF 7Bh [mod 000 r/m] RSLDT mem80 


Restore LDTR and Descriptor 
Restores Local Descriptor Table (LDTR) from mem80. 


SVTS OF 7Ch [mod 000 r/m] SVTS mems0 


Save TSR and Descriptor 
Saves Task State Register (TSR) to mem80. 


RSTS OF 7Dh [mod 000 r/m] RSTS mems0 


Restore TSR and Descriptor 
Restores Task State Register (TSR) from mem80. 


SMINT OF 38h SMINT 


Software SMM Entry 


CPU enters SMM. CPU state information is saved in SMM 
memory space header and execution begins at SMM base 
address. 


RSM OF AAh RSM 


Resume Normal Mode 

Exits SMM. The CPU state is restored using the SMM mem- 
ory space header and execution resumes at interrupted 
point. 


Notes: smem80 = 80-bit memory location. 
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The SMINT instruction can be used by software to 
enter SMM. The SMINT instruction can only be 
used outside an SMM routine if all the conditions 
listed below are true. 


1) USE_SMI = 1 

2) SMAR size > 0 

3) Current Privilege Level = 0 
4) SMAC = 1 


If SMl# is asserted to the CPU during a software 
SMI, the hardware SMI# is serviced after the soft- 
ware SMI has been exited by execution of the RSM 
instruction. 


All the SMM instructions (except RSM and SMINT) 
save or restore 80 bits of data, allowing the saved 
values to include the hidden portion of the register 
contents. 


3.11.7 SMM Memory Space 


SMM memory space is defined by specifying the 
base address and size of the SMM memory space 
in the SMAR register. The base address must be a 
multiple of the SMM memory space size. For 
example, a 32KB SMM memory space must be 
located at a 32KB address boundary. The memory 
space size can range from 4KB to 32MB. Execution 
of the interrupt begins at the base of the SMM 
memory space. 


SMM memory space accesses are always cache- 
able, which allows SMM routines to run faster. 


3.11.8 SMI Generation 


Virtualization software depends on processor- 
specific hardware to generate SMI interrupts for 
each memory or I/O access to the device being 
implemented. The MediaGX processor implements 
SMI generation for VGA accesses. Memory write 
operations in regions A0000h to AFFFFh, BOOOOh 
to B7FFFh, and B8000h to BFFFFh generate an 
SMI. 


Memory reads are not trapped by the MediaGX 
processor. The MediaGX processor traps I/O 
addresses for VGA in the following regions: 3BO0h 
to 3BFh, 3COh to 3CFh, and 3D0h to 3DFh. 
Memory-write trapping is performed during instruc- 
tion decode in the processor core. I/O read and 
write trapping is implemented in the Internal Bus 
Interface Unit of the MediaGX processor. 


The SMl-generation hardware requires two addi- 
tional configuration registers to control and mask 
SMI interrupts in the VGA memory space: 
VGACTL and VGAM. The VGACTL register has a 
control bit for each address range shown above. 
The VGAM register has 32 bits that can selectively 
disable 2KB regions within the VGA memory. The 
VGAM applies only to the AO000h-to-AFFFFh 
region. If this region is not enabled in VGA_CTL, 
then the contents of VGAM is ignored. The 
purpose of VGAM is to prevent SMI from occurring 
when non-displayed VGA memory is accessed. 
This is an enhancement which improves perfor- 
mance for double-buffered applications. The 
format of each register is shown in Chapter 4 of 
this document. 
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3.11.9 SMI Service Routine 


Execution 


Upon entry into SMM, after the SMM header has 
been saved, the CRO, EFLAGS, and DR7 registers 
are set to their reset values. The Code Segment 
(CS) register is loaded with the base, as defined by 
the SMAR register, and a limit of 4 GBytes. The 
SMI service routine then begins execution at the 
SMM base address in real mode. 


The programmer must save the value of any regis- 
ters that may be changed by the SMI service 
routine. For data accesses immediately after 
entering the SMI service routine, the programmer 
must use CS as a segment override. I/O port 
access is possible during the routine but care must 
be taken to save registers modified by the I/O 
instructions. Before using a segment register, the 
register and the register’s descriptor cache contents 
should be saved using the SVDC instruction. 


Hardware interrupts, INTRs and NMIs, may be 
serviced during an SMI service routine. If interrupts 
are to be serviced while executing in the SMM 
memory space, the SMM memory space must be 
within the address range of 0 to 1MB to guarantee 
proper return to the SMI service routine after 
handling the interrupt. 


INTRs are automatically disabled when entering 
SMM since the IF flag (EFLAGS register, bit 9) is 
set to its reset value. Once in SMM, the INTR can 
be enabled by setting the IF flag. An NMI event in 
SMM can be enabled by setting NMI_EN high in 
the CCR register (Index C3h[1]). If NMI is not 
enabled while in SMM, the CPU latches one NMI 
event and services the interrupt after NMI has been 
enabled or after exiting SMM through the RSM 
instruction. The processor is always in real mode in 
SMM, but it may exit to either real or protected 
mode depending on its state when SMM was initi- 
ated. The IDT (Interrupt Descriptor Table) indicates 
which state it will exit to. 


Within the SMI service routine, protected mode 
may be entered and exited as required, and real or 
protected mode device drivers may be called. 


To exit the SMI service routine, a Resume (RSM) 
instruction, rather than an IRET, is executed. The 
RSM instruction causes the MediaGX processor 
core to restore the CPU state using the SMM 
header information and resume execution at the 
interrupted point. If the full CPU state was saved by 
the programmer, the stored values should be 
reloaded before executing the RSM instruction 
using the MOV, RSDC, RSLDT and RSTS instruc- 
tions. 


3.11.9.1 SMI Nesting 


The SMI mechanism supports nesting of SMI inter- 
rupts through the SMI handler, the SMI_NEST bit 
in CCR4[6] (Index E8h), and the Nested SMI 
Status bit (bit N in the SMM header, see Table 3-35 
"SMM Memory Space Header Description" on 
page 91). Nesting is an important capability in 
allowing high-priority events, such as audio virtual- 
ization, to interrupt lower-priority SMI code for VGA 
virtualization or power management. SMI_NEST 
controls whether SMI interrupts can occur during 
SMM. SMI handlers can optionally set SMI_NEST 
high to allow higher-priority SMI interrupts while 
handling the current event. 


The SMI handler is responsible for managing the 
SMI header data for nested SMI interrupts. The 
SMI header must be saved before SMI NEST is 
set high, and SMI_NEST must be cleared and its 
header information restored before an RSM 
instruction is executed. 


The Nested SMI Status bit has been added to the 
SMM header to show whether the current SMI is 
nested. The processor sets Nested SMI Status 
high if the processor was in SMM when the SMI 
was taken. The processor uses Nested SMI Status 
on exit to determine whether the processor should 
stay in SMM. 
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When SMI nesting is disabled, the processor holds 


off external SMI interrupts until the currently 


executing SMM code exits. When SMI nesting is 
enabled, the processor can proceed with the SMI. 


The SMI handler will guarantee that no internal 
SMls are generated in SMM, so the processor 


ignores such events. If the internal and external 
SMI signals are received simultaneously, then the 


internal SMI is given priority to avoid losing the 
event. 


microcode clears SMI_NEST, sets Nested 
SMI Status high and saves the previous value 
of Nested SMI Status (1) in the SMI header. 


E. The second-level SMI handler saves the 


header and sets SMI_NEST to re-enable SMI 
interrupts within SMM. Another level of 
nesting could occur during this period. 


F. The second-level SMI handler clears 


The state diagram of the SMI_NEST and Nested 
SMI Status bits are shown in Figure 3-11 with each 
state is explained next. 


A. When the processor is outside of SMM, 


Nested SMI Status is always clear and 
SMI_NEST is set high. 


B. The first-level SMI interrupt is received by the 


processor. The microcode clears SMI_NEST, 
sets Nested SMI Status high and saves the 
previous value of Nested SMI Status (0) in 
the SMI header. 


. The first-level SMI handler saves the header 
and sets SMI_NEST high to re-enable SMI 
interrupts from SMM. 


. A second-level (nested) SMI interrupt is 
received by the processor. This SMI is taken 
even though the processor is in SMM 
because the SMI_NEST bit is set high. The 


SMI_NEST to disable SMI interrupts, then 
restores its SMI header. 


G. The second-level SMI handler executes an 
RSM. The microcode sets SMI_NEST, and 
restores the Nested SMI Status (1) based on 
the SMI header. 


H. The first-level SMI handler clears SMI_NEST 
to disable SMI interrupts, then restores its 
SMI header. 


|. The first-level SMI handler executes an RSM. 
The microcode sets SMI_NEST high and 
restores the Nested SMI Status (0) based on 
the SMI header. 


When the processor is outside of SMM, Nested 
SMI Status is always clear and SMI_NEST is set 
high. 


SMI_NEST 


Nested SMI Status [__] 
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3.11.9.2 CPU States Related to SMM 
and Suspend Mode 


The state diagram shown in Figure 3-12 illustrates 
the various CPU states associated with SMM and 
Suspend mode. While in the SMI service routine, 
the MediaGX processor core can enter Suspend 
mode either by (1) executing a halt (HLT) instruc- 
tion or (2) by asserting the SUSP# input. 


During SMM operations and while in SUSP#-initi- 
ated Suspend mode, an occurrence of either NMI 
or INTR is latched. (In order for INTR to be latched, 


System Management Mode 


the IF flag, EFLAGS register bit 9, must be set.) 
The INTR or NMI is serviced after exiting Suspend 
mode. 


If Suspend mode is entered through a HLT instruc- 
tion from the operating system or application soft- 
ware, the reception of an SMI# interrupt causes the 
CPU to exit Suspend mode and enter SMM. If 
Suspend mode is entered through the hardware 
(SUSP# = 0) while the operating system or applica- 
tion software is active, the CPU latches one occur- 
rence of INTR, NMI, and SMI#. 


Suspend Mode NMI or INTR 


(SUSPA# = 0) 


Interrupt Service 
Routine 


NMI or INTR 


SUSP# = 0 
RESET OS/Application Suspend Mode 
Software le SUSP# = 1 (SUSPA# = 0) 
a (INTR, NMI and SMI# latched) 
SMI# = 0 SMI# = 0 : 
SMINT* RSM 
Non-SMM Operations 
SMM Operations 
Vv 


SMI Service Routine 
(SMI# = 0) 


NMI or aie 


SUSP# = 0 


Interrupt Service 
Routine 

Suspend Mode 
(SUSPA# = 0) 


*Instructions 
(INTR and NMI latched) 


SUSP# = 1 


a 


Suspend Mode 
(SUSPA# = 0) 


NMI or INTR 


Interrupt Service 
Routine 


Figure 3-12 SMM and Suspend Mode State Diagram 
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3.12 Shutdown and Halt 


The Halt Instruction (HLT) stops program execution 
and generates a special Halt bus cycle. The 
MediaGxX processor core then drives out a special 
Stop Grant bus cycle and enters a low-power 
Suspend mode if the SUSP_HLT bit in CCR2 
(Index C2h[3]) is set. SMI#, NMI, INTR with inter- 
rupts enabled (IF bit in EFLAGS = 1), or RESET 
forces the CPU out of the halt state. If the halt state 
is interrupted, the saved code segment and 
instruction pointer specify the instruction following 
the HLT. 


Shutdown occurs when a severe error is detected 
that prevents further processing. The most 
common severe error is the triple fault, a fault 
event while handling a double fault. Setting the IDT 
or the GDT limit to zero will cause a triple fault. 


An NMI input or a reset can bring the processor out 
of shutdown. An NMI will work if the IDT limit is 
large enough, at least OOOFh, to contain the NMI 
interrupt vector and if the stack has enough room. 
The stack must be large enough to contain the 
vector and flag information (the stack pointer must 
be greater than 0005h). 


3.13 Protection 


Segment protection and page protection are safe- 
guards built into the MediaGX processor’s 
protected-mode architecture that denies unautho- 
rized or incorrect access to selected memory 
addresses. These safeguards allow multitasking 
programs to be isolated from each other and from 
the operating system. This section concentrates on 
segment protection. 


Selectors and descriptors are the key elements in 
the segment protection mechanism. The segment 
base address, size, and privilege level are estab- 
lished by a segment descriptor. Privilege levels 
control the use of privileged instructions, I/O 
instructions and access to segments and segment 
descriptors. Selectors are used to locate segment 
descriptors. 


Segment accesses are divided into two basic 
types, those involving code segments (e.g., control 
transfers) and those involving data accesses. The 
ability of a task to access a segment depends on 
the: 


* segment type 

¢ instruction requesting access 

* type of descriptor used to define the segment 
* associated privilege levels (described next) 


Data stored in a segment can be accessed only by 
code executing at the same or a more privileged 
level. A code segment or procedure can only be 
called by a task executing at the same or a less 
privileged level. 


3.13.1 Privilege Levels 


The values for privilege levels range between 0 
and 3. Level 0 is the highest privilege level (most 
privileged), and level 3 is the lowest privilege level 
(least privileged). The privilege level in real mode is 
zero. 


The Descriptor Privilege Level (DPL) is the privi- 
lege level defined for a segment in the segment 
descriptor. The DPL field specifies the minimum 
privilege level needed to access the memory 
segment pointed to by the descriptor. 


The Current Privilege Level (CPL) is defined as 
the current task’s privilege level. The CPL of an 
executing task is stored in the hidden portion of the 
code segment register and essentially is the DPL 
for the current code segment. 


The Requested Privilege Level (RPL) specifies a 
selector’s privilege level. RPL is used to distinguish 
between the privilege level of a routine actually 
accessing memory (the CPL), and the privilege 
level of the original requester (the RPL) of the 
memory access. If the level requested by RPL is 
less than the CPL, the RPL level is accepted and 
the Effective Privilege Level (EPL) is changed to 
the RPL value. If the level requested by RPL is 
greater than CPL, the CPL overrides the requested 
RPL and EPL becomes the CPL value. 
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The lesser of the RPL and CPL is called the Effective 
Privilege Level (EPL). Therefore, if RPL =O ina 
segment selector, the EPL is always determined by 
the CPL. If RPL = 3, the EPL is always 3 regard- 
less of the CPL. 


For a memory access to succeed, the EPL must be 
at least as privileged as the Descriptor Privilege 
Level (EPL < DPL). If the EPL is less privileged 
than the DPL (EPL > DPL), a general protection 
fault is generated. For example, if a segment has a 
DPL = 2, an instruction accessing the segment 
only succeeds if executed with an EPL < 2. 


3.13.2 I/O Privilege Levels 


The I/O Privilege Level (IOPL) allows the operating 
system executing at CPL = 0 to define the least 
privileged level at which IOPL-sensitive instruc- 
tions can unconditionally be used. The IOPL-sensi- 
tive instructions include CLI, IN, OUT, INS, OUTS, 
REP INS, REP OUTS, and STI. Modification of the 
IF bit in the EFLAGS register is also sensitive to the 
I/O privilege level. 


The IOPL is stored in the EFLAGS register (bits 
[31:12]). An I/O permission bit map is available as 
defined by the 32-bit Task State Segment (TSS). 
Since each task can have its TSS, access to indi- 
vidual I/O ports can be granted through separate 
I/O permission bit maps. 


If CPL < lIOPL, IOPL-sensitive operations can be 
performed. If CPL > IOPL, a general protection 
fault is generated if the current task is associated 
with a 16-bit TSS. If the current task is associated 
with a 32-bit TSS and CPL > IOPL, the CPU 
consults the I/O permission bitmap in the TSS to 
determine on a port-by-port basis whether or not I/O 
instructions (IN, OUT, INS, OUTS, REP INS, REP 
OUTS) are permitted. The remaining IOPL-sensi- 
tive operations generate a general protection fault. 


3.13.3 Privilege Level Transfers 


A task’s CPL can be changed only through inter- 
segment control transfers using gates or task 
switches to a code segment with a different privilege 
level. Control transfers result from exception and 
interrupt servicing and from execution of the CALL, 
JMP, INT, IRET and RET instructions. 


There are five types of control transfers that are 
summarized in Table 3-37. Control transfers can be 
made only when the operation causing the control 
transfer references the correct descriptor type. Any 
violation of these descriptor usage rules causes a 
general protection fault. 


Any control transfer that changes the CPL within a 
task results in a change of stack. The initial values 
for the stack segment (SS) and stack pointer (ESP) 
for privilege levels 0, 1, and 2 are stored in the 
TSS. During a JMP or CALL control transfer, the 
SS and ESP are loaded with the new stack pointer 
and the previous stack pointer is saved on the new 
stack. When returning to the original privilege level, 
the RET or IRET instruction restores the SS and 
ESP of the less-privileged stack. 
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Table 3-37 Descriptor Types Used for Control Transfer 


Descriptor Descriptor 
Type of Control Transfer Operation Types Referenced Table 
Intersegment within the same privilege | JMP, CALL, RET, IRET* Code Segment GDT or LDT 
level. 
Intersegment to the same or a more CALL Gate Call GDT or LDT 


privileged level. Interrupt within task 
(could change CPL level). 


Interrupt Instruction, Exception, 
External Interrupt 


Trap or Interrupt Gate | IDT 


Exception, External Interrupt 


Intersegment to a less privileged level | RET, IRET* Code Segment GDT or LDT 

(changes task CPL). 

Task Switch via TSS CALL, JMP Task State Segment GDT 

Task Switch via Task Gate CALL, JMP Task Gate GDT or LDT 
IRET**, Interrupt Instruction, Task Gate IDT 


Note: “NT =0 (Nested Task bit in EFLAGS, bit 14) 


“*NT =1 (Nested Task bit in EFLAGS, bit 14) 


3.13.3.1 Gates 


Gate descriptors described in Section 3.7.5 “Gate 
Descriptors” on page 76, provide protection for 
privilege transfers among executable segments. 
Gates are used to transition to routines of the same 
or a more privileged level. Call gates, interrupt 
gates and trap gates are used for privilege transfers 
within a task. Task gates are used to transfer 
between tasks. 


Gates conform to the standard rules of privilege. In 
other words, gates can be accessed by a task if the 
effective privilege level (EPL) is the same or more 
privileged than the gate descriptor’s privilege level 
(DPL). 


3.13.4 Initialization and Transition to 


Protected Mode 


The MediaGX processor core switches to real 
mode immediately after RESET. While operating in 
real mode, the system tables and registers should 
be initialized. The GDTR and IDTR must point to a 
valid GDT and IDT, respectively. The size of the IDT 
should be at least 256 bytes, and the GDT must 
contain descriptors that describe the initial code 
and data segments. 


The processor can be placed in protected mode by 
setting the PE bit (CRO register bit 0). After 
enabling protected mode, the CS register should be 
loaded and the instruction decode queue should be 
flushed by executing an intersegment JMP. Finally, 
all data segment registers should be initialized with 
appropriate selector values. 
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3.14 Virtual 8086 Mode 


Both real mode and virtual 8086 (V86) modes are 
supported by the MediaGX processor, allowing 
execution of 8086 application programs and 8086 
operating systems. V86 mode allows the execution 
of 8086-type applications, yet still permits use of 
the paging and protection mechanisms. V86 tasks 
run at privilege level 3. Before entry, all segment 
limits must be set to FFFFh (64k) as in real mode. 


3.14.1. Memory Addressing 


While in V86 mode, segment registers are used in 
an identical fashion to real mode. The contents of 
the Segment register are multiplied by 16 and 
added to the offset to form the Segment Base 
Linear Address. The MediaGX processor permits 
the operating system to select which programs use 
the V86 address mechanism and which programs 
use protected mode addressing for each task. 


The MediaGX processor also permits the use of 
paging when operating in V86 mode. Using paging, 
the 1MB address space of the V86 task can be 
mapped to any region in the 4GB linear address 
space. 


The paging hardware allows multiple V86 tasks to 
run concurrently, and provides protection and oper- 
ating system isolation. The paging hardware must 
be enabled to run multiple V86 tasks or to relocate 
the address space of a V86 task to physical 
address space other than 0. 


3.14.2 Protection 


All V86 tasks operate with the least amount of priv- 
ilege (level 3) and are subject to all CPU protected 
mode protection checks. As a result, any attempt to 
execute a privileged instruction within a V86 task 
results in a general protection fault. 


In V86 mode, a slightly different set of instructions 
are sensitive to the I/O privilege level (IOPL) than 

in protected mode. These instructions are: CLI, INT 
n, IRET, POPF, PUSHF, and STI. The INT3, INTO 


and BOUND variations of the INT instruction are 
not IOPL sensitive. 


3.14.3 Interrupt Handling 


To fully support the emulation of an 8086-type 
machine, interrupts in V86 mode are handled as 
follows. When an interrupt or exception is serviced 
in V86 mode, program execution transfers to the 
interrupt service routine at privilege level 0 (i.e., 
transition from V86 to protected mode occurs). The 
VM bit in the EFLAGS register (bit 17) is cleared. 
The protected mode interrupt service routine then 
determines if the interrupt came from a protected 
mode or V86 application by examining the VM bit 
in the EFLAGS image stored on the stack. The 
interrupt service routine may then choose to allow 
the 8086 operating system to handle the interrupt 
or may emulate the function of the interrupt 
handler. Following completion of the interrupt 
service routine, an IRET instruction restores the 
EFLAGS register (restores VM = 1) and segment 
selectors and control returns to the interrupted V86 
task. 


3.14.4 Entering and Leaving 


Virtual 8086 Mode 


V86 mode is entered from protected mode by 
either executing an IRET instruction at CPL = 0 or 
by task switching. If an IRET is used, the stack 
must contain an EFLAGS image with VM = 1. Ifa 
task switch is used, the TSS must contain an 
EFLAGS image containing a 1 in the VM bit posi- 
tion. The POPF instruction cannot be used to enter 
V86 mode since the state of the VM bit is not 
affected. V86 mode can only be exited as the result 
of an interrupt or exception. The transition out must 
use a 32-bit trap or interrupt gate that must point to 
a non-conforming privilege level 0 segment (DPL = 
0), or a 32-bit TSS. These restrictions are required 
to permit the trap handler to IRET back to the V86 
program. 
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3.15 Floating Point Unit Operations 


The FPU is x87-instruction-set compatible and 
adheres to the IEEE-754 standard. Because most 
applications that contain FPU instructions intermix 
with integer instructions, the MediaGX processor’s 
FPU achieves high performance by completing 
integer and FPU operations in parallel. 


3.15.1 FPU (Floating Point Unit) 


Register Set 


In addition to the registers described to this point, 
the FPU within the CPU provides the user eight 
data registers accessed in a stack-like manner, a 
control register, and a status register. The CPU 
also provides a data register tag word that 
improves context switching and stack performance 
by maintaining empty/non-empty status for each of 
the eight data registers. In addition, registers 
contain pointers to (a) the memory location 
containing the current instruction word and (b) the 
memory location containing the operand associ- 
ated with the current instruction word (if any). 


3.15.2 FPU Tag Word Register 


The CPU maintains a tag word register that is 
divided into eight tag word fields. These fields 
assume one of four values depending on the 
contents of their associated data registers: Valid 
(00), Zero (01), Special (10), and Empty (11). Note: 
Denormal, Infinity, QNaN, SNaN and unsupported 
formats are tagged as “Special”? Tag values are 
maintained transparently by the CPU and are only 
available to the programmer indirectly through the 
FSTENV and FSAVE instructions. The tag word 
with tag fields for each associated physical register, 
tag(n), is shown in Table 3-38. 


3.15.3  FPU Status Register 


The FPU communicates status information and 
operation results to the CPU through the status 
register. The fields in the FPU status register are 
detailed in Table 3-38. These fields include infor- 
mation related to exception status, operation 
execution status, register status, operand class, 
and comparison results. This register is continu- 
ously accessible to the CPU regardless of the state 
of the Control or Execution Units. 


3.15.4 FPU Mode Control Register 


The FPU Mode Control Register (MCR) shown in 
Table 3-38 is used by the MediaGX processor to 
specify the operating mode of the FPU. The MCR 
register fields include information related to the 
rounding mode selected, the amount of precision 
to be used in the calculations, and the exception 
conditions which should be reported to the 
MediaGX processor using traps. The user controls 
precision, rounding, and exception reporting by 
setting or clearing appropriate bits in the MCR. 
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Table 3-38 FPU Registers 


Bit Name Description 


FPU Tag Word Register 


15:14 TAG7 TAG7: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 
13:12 TAG6 TAGE: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 
11:10 TAG5 TAGS: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 
9:8 TAG4 TAG4: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 
7:6 TAG3 TAG3: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 
5:4 TAG2 TAG2: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 
3:2 TAG1 TAG1: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 

1:0 TAGO TAGO: 00 = Valid; 01 = Zero; 10 = Special; 11 = Empty. 

FPU Status Register 

15 B Copy of ES bit (bit 7 this register) 

14 C3 Condition code bit 3 
13:11 iS) Top-of-Stack: Register number that points to the current TOS. 
10:8 C[2:0] Condition code bits [2:0] 

7 ES Error indicator: Set to 1 if unmasked exception detected. 

6 SF Stack Full: FPU Status Register: or invalid register operation bit. 

5 P Precision error exception bit 

4 U Underflow error exception bit 

3 oO Overflow error exception bit 

2 Z Divide-by-zero exception bit 

1 D Denormalized-operand error exception bit 

0 I Invalid operation exception bit 


FPU Mode Control Register 


15:12 RSVD Reserved: Set to 0. 

11:10 RC Rounding Control Bits: 

00 = Round to nearest or even 
01 = Round towards minus infinity 
10 = Round towards plus infinity 
11 = Truncate 

9:8 PC Precision Control Bits: 

00 = 24-bit mantissa 

01 = Reserved 

10 = 53-bit mantissa 

11 = 64-bit mantissa 

7:6 RSVD Reserved: Set to 0. 

P Precision error exception bit 
FPU Mode Control Register 


U 
oO Overflow error exception bit 
Z 
D 
| 


Divide-by-zero exception bit 


Denormalized-operand error exception bit 


O;-+;/Ml]oO!;ARI a 


Invalid-operation exception bit 
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4 Integrated Functions 


The Cyrix MediaGX MMX-Enhanced processor 
integrates a memory controller, graphics pipeline 
and display controller in a Unified Memory Archi- 
tecture (UMA). UMA simplifies system designs and 
significantly reduces overall system costs associ- 
ated with high chip count, small footprint notebook 
designs. Performance degradation in traditional 
UMA systems is reduced through the use of Cyrix’s 
Display Compression Technology™ (DCT™). 


Integrated x86 Solution with MMX™ Support 


4 | 
a 


Figure 4-1 shows the major functional blocks of the 
MediaGX processor and how the Internal Bus 
Interface Unit operates as the interface between 
the processor’s core units and the integrated func- 
tions. 


This section details how the integrated functions 
and Internal Bus Interface Unit operate and their 
respective registers. 


_ yy! Write-Back Integer 
Cache Unit MMU Unit FPU 
C-Bus | | 
e 
Internal Bus Interface Unit 
X-Bus 
® ® 
G M Displ PCI 
raphics emory isplay 
Baan Pipeline Controller Controller Controller 
SDRAM Port Cx5520/Cx5530 PCI Bus 
(CRT/LCD TFT) 
Figure 4-1 Internal Block Diagram 
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4.1 Integrated Functions 
Programming Interface 


The MediaGX processor performs mapping for the 
dedicated cache, graphics pipeline, display 
controller, memory controller, and graphics 
memory, including the frame buffer. It maps these 
to high memory addresses or MediaGX processor 
memory space. The base address for these is 
controlled by the Graphics Configuration Register 
(GCR, Index B8h), which specifies address bits 
[31:30] in physical memory. 


Figure 4-2 shows the address map for the 
MediaGX processor. When accessing the 
MediaGX processor memory space, address bits 
[29:24] must be zero. This allows the MediaGX 
processor a linear address space with a total of 
16MB. Address bit 23 divides this space into 8MB 
for control (bit 23 = 0) and 8MB for graphics 
memory (bit 23 = 1). In control space, bits [22:16] 
are not decoded, so the programmer should set 
them to zero. Address bit 15 divides the remaining 
64KB address space into scratchpad RAM and PCI 
access (bit 15 = 0) and control registers (bit 15 = 


1). 


Table 4-1 GCR Register 


Device drivers must be responsible for performing 
physical-to-virtual memory-address translation, 
including allocation of selectors that point to the 
MediaGX processor. The processor may be 
accessed in protected mode by creating a selector 
with the physical address shown in Table 4-1, and 
a limit of 16MB. A selector with a 64KB limit is large 
enough to access all of the MediaGX processor’s 
registers and scratchpad RAM. 


4.1.1 Graphics Control Register 


The MediaGX processor incorporates graphics 
functions that require registers to implement and 
control them. Most of these registers are memory 
mapped and physically located in the logical units 
they control. The mapping of these units is 
controlled by this configuration register. The 
Graphics Control Register (GCR, Index B8h) is I/O- 
mapped because it must be accessed before 
memory mapping can be enabled. Refer to Section 
3.3.2.2 “Configuration Registers” on page 50 for 
information on how to access this register. 


Bit Name Description 


Index B8h 


GCR Register (R/W) 


Default Value = 00h 


7:4 RSVD Reserved: Set to 0. 


3:2 SP 
00 = OKB 
01 = 2KB 
10 = 3KB 
11 =4KB 


Scratchpad Size: Specifies the size of the scratchpad cache. 


ters. 


MediaGX Base Address: Specifies the physical address for the base (GX_BASE) of the scratchpad 
RAM, the graphics memory (frame buffer, compression buffer, etc.) and the other memory mapped regis- 


00 = Scratchpad RAM, Graphics Subsystem, and memory-mapped configuration registers are disabled. 
01 = Scratchpad RAM and control registers start at GX_BASE = 40000000h. 
10 = Scratchpad RAM and control registers start at GX_BASE = 80000000h. 
11 = Scratchpad RAM and control registers start at GX_BASE = C0000000h. 
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Physical Address Map 


AOMIAGEaSS paeeiaa (4GB) 
(256KB) 


FFFC0000h 
PCI Access 


GX_BASE+1000000h 
Graphics Memory 
(Frame Buffer, etc.) 


GX_BASE+800000h 
SMM System Code 


Power Management Registers 
(See Table 6-1 on page 209 


) 
Memory Controller Registers 
Table 4-1 121 
FEFF FFFFh (See Table 4-15 on page ) 
MAX Display Controller Registers 
(See Table 4-30 on page 154) 
Graphics Memory 
(Frame Buffer, etc.) Graphics Pipeline Registers 
(See Table 4-24 on page 139) 
GX_BASE+8100h 
Internal Bus IF Unit Registers 
(See Table 4-9 on page 113) 


GX_BASE+8000h 
PCI Access 


Scratchpad 
(See Table 4-5 on page 109) 


GX_BASE+400000h 
GX_BASE+9000h 


GX_BASE+8500h 
DRAM Map 


GX_BASE+8400h 


GX_BASE+8300h 


GX_BASE+1000h 


PCI Access GX_BASE (See Table 


“Top of DRAM 
100000h (1MB) 100000h (1MB) 
E8000h E8000h 


E0000h E0000h 
UMBs and . 
Expansion ROMs UMBs and Expansion ROMs 


VGA/MDA 


SMM System Code Frame Buffers 
(Soft VGA and/or PCA/ISA 


A0000h (640KB) 
Conventional Memory Conventional Memory 
Oh 


* See BC_DRAM_TOP Table 4-10 on page 114 or MC_GBASE_ADD Table 4-16 on page 122. 


*GBADD or Top of DRAM 


C0000h C0000h 


A0000h (640KB) 


Oh 


Figure 4-2 MediaGX Processor Memory Space 
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4.1.2 Control Registers 


The control registers for the MediaGX processor 
use 32KB of the memory map, starting at 
GX_BASE+8000h (see Figure 4-2). This area is 
divided into Internal Bus Interface Unit, Graphics 
Pipeline, Display Controller, Memory Controller, 
and Power Management sections: 


¢ The Internal Bus Interface Unit maps 100h loca- 
tions starting at GX_BASE+8000h. 


* The Graphics Pipeline maps 200h locations 
starting at GX_BASE+8100h. 


« The Display Controller maps 100h locations 
starting at GX_BASE+8300h. 


« The Memory Controller maps 100h locations 
starting at GX_BASE+8400h 


* GX_BASE+8500h-8FFFh is dedicated to power 
management registers for the serial packet 
transmission control, the user-defined power 
management address space, Suspend Refresh, 
and SMI status for Suspend/Resume. 


The register descriptions are contained in the indi- 
vidual subsections of this chapter. Accesses to 
undefined registers in the MediaGX processor 
control register space will not cause a hardware 
error. 


4.1.3 Graphics Memory 


The MediaGX processor’s graphics memory is 
mapped into 8MB starting at GX_BASE+800000h. 
This area includes the frame buffer memory and 
storage for internal display controller state. The 
frame buffer is a linear map whose size depends 
on the current resolution setup in the memory 
controller. Frame buffer scan lines are not contig- 
uous in many resolutions, so software that renders 
to the frame buffer must use a skip count to 
advance between scan lines. The display controller 
uses the graphics memory that lies between scan 
lines for internal state. For this reason, accessing 
graphics memory between the end of a scan line 


and the start of another can cause display prob- 
lems. The skip count for all supported resolutions is 
shown in Table 4-2. 


Graphics memory is allocated from system DRAM 
by the system BIOS. The graphics memory size is 
programmed by setting the graphics memory base 
address in the memory controller. Display drivers 
communicate with system BIOS about resolution 
changes, to ensure that the correct amount of 
graphics memory is allocated. When a graphics 
resolution change requires an increased amount of 
graphics memory, the system must be rebooted! 
The reason for this restriction is that no mechanism 
exists to recover system DRAM from the operating 
system without rebooting. 


Table 4-2. Display Resolution Skip Counts 


Screen Pixel Skip 
Resolution Depth Count 
640x480 8 bits 1024 
640x480 16 bits 2048 
800x600 8 bits 1024 
800x600 16 bits 2048 
1024x768 8 bits 1024 
1024x768 16 bits 2048 
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4.1.4 L1 Cache Controller 


The MediaGX processor contains an on-board 
16KB unified data/instruction L1 cache. It operates 
in write-back mode. Since the memory controller is 
also on-board, the L1 cache requires no external 
logic to maintain coherency. All DMA cycles auto- 
matically snoop the L1 cache. For improved 
graphics performance, part of the L1 cache oper- 
ates as a scratchpad RAM to be used by the 
graphics pipeline as a BLT Buffer. 


The CD bit (Cache Disable, bit 30) in CRO globally 
controls the operating mode of the L1 cache. LCD 
and LWT, Local Cache Disable and Local Write- 
through bits in the Translation Lookaside Buffer, 
control the mode on a page-by-page basis. Addi- 
tionally, memory configuration control can specify 
certain memory regions as non-cacheable. 


Write-back caching improves performance by 
relieving congestion on slower external buses. 
With four dirty bits, the cache marks dirty locations 
on a double-word basis. This further reduces the 
number of double-word bus write operations 
needed during a replacement or flush operation. 


Table 4-3 L1 Cache BitBLT Register Summary 


The MediaGxX processor will cache SMM regions. 
This speeds up system management overhead to 
allow for hardware emulation such as VGA. 


The cache of the MediaGX processor provides the 
ability to redefine 2KB, 3KB, or 4KB of the L1 
cache to be scratchpad memory. The scratchpad 
area is memory mapped to the upper memory 
region defined by the GCR register (Index B8h). 
The valid bits for the scratchpad RAM will always 
be true and the scratchpad RAM locations will 
never be flushed to memory. The scratchbad RAM 
serves as a general purpose high speed RAM and 
as a BLT buffer for the graphics pipeline. Incre- 
menting BLT buffer address registers have been 
added to enable the graphics pipeline to access 
this memory as a BLT buffer. A 16-byte line buffer 
dedicated to the graphics pipeline accesses has 
been added to minimize graphics interference with 
normal CPU operation. 


Table 4-3 summarizes the registers contained in 
the L1 cache. These registers do not have default 
values and must be initialized before use. Table 4-4 
gives the register/bit formats. 


Mnemonic Name Function 


L1_BBO BASE 
L1 Cache BitBLT 0 Base Address 


Contains the address offset to the first byte of BLT Buffer 0 in the 
scratchpad memory. 


L1_BBO POINTER 
L1 Cache BitBLT 0 Pointer 


Contains the address offset to the current line of BLT Buffer 0 in the 
scratchpad memory. 


Li_BB1_BASE 


L1 Cache BitBLT 1 Base Address memory. 


Contains the offset to the first byte of BLT Buffer 1 in the scratchpad 


L1_BB1_POINTER 
L1 Cache BitBLT 1 Pointer 


Contains the address offset to the current line of BLT Buffer 1 in the 
scratchpad memory. 


Note: 
on page 111. 


For information on accessing these registers, refer to Section 4.1.6 “CPU_READ/CPU_WRITE Instructions” 
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Table 4-4 L1 Cache BitBLT Registers 


Bit Name Description 
L1_BBO_BASE Register (R/W) Default Value = None 
15:12 RSVD Reserved: Set to 0. 
11:4 INDEX BitBLT 0 Base Index: The index to the starting line of BLT Buffer 0. 
3:0 BYTE BitBLT 0 Starting Byte: Determines which byte of the starting line is the beginning of BLT Buffer 0. 
L1_BBO_POINTER Register (R/W) Default Value = None 
15:12 RSVD Reserved: Set to 0. 
11:4 INDEX BitBLT 0 Pointer Index: The index to the current line of BLT Buffer 0. 
3:0 RSVD Reserved: Set to 0. 
L1_BB1_Base Register (R/W) Default Value = None 
15:12 RSVD Reserved: Set to 0. 
11:4 INDEX BitBLT 1 Base Index: The index to the starting line of BLT Buffer 1. 
3:0 BYTE BitBLT 1 Starting Byte: Determines which byte of the starting line is the beginning of BLT Buffer 1. 
L1_BB1_POINTER Register (R/W) Default Value = None 
15:12 RSVD Reserved: Set to 0. 
11:4 INDEX BitBLT 1 Pointer Index: The index to the current line of BLT Buffer 1. 
3:0 RSVD Reserved: Set to 0. 
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4.1.4.1 Scratchpad Memory 


The scratchpad RAM is a dedicated high-speed 
memory cache that contains BLT buffers, SMM 
header, and a scratchpad area for display drivers. 
It provides both L1 cache performance and a dedi- 
cated resource that cannot be thrown out by other 
system activity. The configuration of the scratchpad 
is based on graphics resolution and is described in 
Table 4-5. 


The scratchpad memory is part of the on-chip L1 
cache memory. The memory size is controlled by 
bits in the GCR register (Index B8h). The 
scratchpad memory can be disabled, or sized to 
2KB, 3KB, or 4KB. The remaining L1 cache size is 
16KB minus the scratchpad size, and all of the 
scratchpad area is subtracted from a single way. 


The scratchpad memory is used by display drivers 
and virtualization software. Because this resource 
must be tightly controlled to avoid conflicts, appli- 
cation software and third-party drivers should avoid 
accesses to the scratchpad area. 


The display driver creates and manages two BLT 
buffers from within the scratchpad area. These BLT 
buffers are used to transfer source data from 


Table 4-5 Scratchpad Organization 


system memory into the frame buffer, or for desti- 
nation data from system memory or the frame 
buffer. The graphics pipeline accesses the BLT 
buffers for many common operations, including 
BitBLT transfers, output primitives, and raster text. 
Display drivers also use a small portion of the 
scratchpad as an extended register file, since 
scratchpad read and write accesses are very fast 
compared to normal memory operations. 


The virtualization software uses the scratchpad 
area to store critical SMM information, including 
the SMI header and SMM system state. No SMM 
code currently resides in the scratchpad area, 
although this is an option for future products. 


When the BLT buffer pointer is used (refer to Table 
4-8) addresses outside the scratchpad range will 
wrap around back into the scratchpad RAM. Table 
4-5 shows the allocation of scratchpad memory for 
the 2KB and 3KB configurations of the scratchpad. 
The 2KB configuration uses GX_BASE+0800h to 
GX_BASE+1000h. The 3KB configuration uses 
GX_BASE+0400h to GX_BASE+1000h. These 
configurations are fixed by the system BIOS during 
boot and cannot be changed without rebooting the 
system. 


2KB Configuration 3KB Configuration 
Offset Size Offset Size Description 
GX_BASE + OEEOh 288 bytes GX_BASE + OEEOh 288 bytes SMM scratchpad 
GX_BASE + OE60h 128 bytes GX_BASE + OE60h 128 bytes Driver scratchpad 
GX_BASE + 0B30h 816 bytes GX_BASE + 0930h 1328 bytes BLT Buffer 0 
GX_BASE + 0800h 816 bytes GX_BASE + 0400h 1328 bytes BLT Buffer 1 
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4.1.5 Display Driver Instructions 


The MediaGX processor has four instructions to 
access processor core registers. Table 4-6 shows 
these instructions. 


Adding CPU instructions does not create a 
compatibility problem for applications that may 
depend on receiving illegal opcode traps. The solu- 
tion is to make these instructions generate an 
illegal opcode trap unless a compatibility bit is 
explicitly set. The MediaGX processor uses the 


Table 4-6 Display Driver Instructions 


scratchpad size field (bits [3:2] in GCR, Index B8h) 
to enable or disable all of the graphics instructions. 
If the scratchpad size bits are zero, meaning that 
none of the cache is defined as scratchpad, then 
hardware will assume that the graphics controller is 


not being used and the graphics instructions will be 
disabled. Any other scratchpad size will enable all 


of the new instructions. Note that the base address 
of the memory map in the GCR register can still be 
set up to allow access to the memory controller 
registers. 


Syntax Opcode Description 

BBO_RESET OF3A Reset the BLT Buffer 0 pointer to the base. 

BB1_RESET OF3B Reset the BLT Buffer 1 pointer to the base. 

CPU_WRITE OF3C Write data to CPU internal register. 

CPU_READ OF3D Read data from CPU internal register. 
— SSS 
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4.1.6 CPU_READ/CPU_WRITE 


Instructions 


The MediaGX processor has several internal regis- 
ters that control the BLT buffer and power manage- 
ment circuitry in the dedicated cache subsystem. 
To avoid adding additional instructions to read and 
write these registers, the MediaGX processor has 
a general mechanism to access internal CPU 
registers with reasonable performance. The 
MediaGX processor has two special instructions to 
read and write CPU registers: CPU_READ and 
CPU_WRITE. Both instructions fetch a 32-bit 
register address from EBX as shown in Table 4-7 
and Table 4-8. CPU_WRITE uses EAX for the 


Table 4-7 CPU-Access Instructions 


source data, and CPU_READ uses EAX as the 
destination. Both instructions always transfer 32 
bits of data. 


These instructions work by initiating a special I/O 
transaction where the high address bit is set. This 
provides a very large address space for internal 
CPU registers. 


The BLT buffer base registers define the starting 
physical addresses of the BLT buffers located 
within the dedicated L1 cache. The dedicated 
cache can be configured for up to 4KB, so 12 
address bits are required for each base address. 


Syntax Opcode Registers Length 
CPU_WRITE OF3Ch EBX = 32-bit address, EAX = Source 2 bytes 
CPU_READ OF3Dh EBX = 32-bit address, EAX = Destination 2 bytes 


Table 4-8 Address Map for CPU-Access Registers 


Register EBX Address Description 

L1_BBO_BASE FFFF FFOCh BLT Buffer 0 base address (see Table 4-4 on page 108). 

L1_BB1_BASE FFFF FF1Ch BLT Buffer 1 base address (see Table 4-4 on page 108). 

L1_BBO_POINTER FFFF FF2Ch BLT Buffer 0 pointer address (see Table 4-4 on page 108). 

L1_BB1_POINTER FFFF FF3Ch BLT Buffer 1 pointer address (see Table 4-4 on page 108). 

PM_BASE FFFF FF6Ch Power management base address (see Table 6-3 on page 212). 

PM_MASK FFFF FF7Ch Power management address mask (see Table 6-3 on page 212). 
— SSS 
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4.2 Internal Bus Interface Unit 


The MediaGX processor’s Internal Bus Interface 
Unit provides control and interface functions to the 
internal C-Bus (processor core, FPU, graphics 
pipeline, and L1 cache) and X-Bus (PCI controller, 
display controller, memory controller, and graphics 
accelerator) paths, provides control for several 
sections of memory, and plays an important part in 
the Virtual VGA function. 


The Internal Bus Interface Unit performs, without 
loss of compatibility, the functions that previously 
required the external pins IGNNE# and A20M#. 


The Internal Bus Interface Unit provides configura- 
tion control for up to 20 different regions within 
system memory. It provides 19 configurable 
memory regions in the address space between 
640KB and 1MB, with separate control for read 
access, write access, cacheability, and PCI 
access. 


The memory configuration control includes a top- 
of-memory register and hardware support for VGA 
emulation plus, the capability to program 20 
regions of the memory map for different ROM 
configurations, and to locate memory-mapped I/O. 


4.2.1 FPU Error Support 


The FERR# (floating point error) and IGNNE# 
(ignore numeric error) pins of the 486 micropro- 
cessor have been replaced with an IRQ13 (inter- 
rupt request 13) pin. In DOS systems, FPU errors 
are reported by the external vector 13. This mode 
of operation is specified by clearing the NE bit (bit 
5) in the CRO register. If the NE bit is active, the 
IRQ13 output of the MediaGX processor is always 
driven inactive. If the NE bit is cleared, the 
MediaGX processor drives IRQ13 active when the 
ES bit (bit 7) in the FPU Status Register is set high. 
Software must respond to this interrupt with an 
OUT instruction of an 8-bit operand to FOh or Fth. 
When the OUT cycle occurs, the IRQ13 pin is 
driven inactive and the FPU starts ignoring numeric 
errors. When the ES bit is cleared, the FPU 
resumes monitoring numeric errors. 


4.2.2 A20M Support 


The MediaGX processor provides an A20M bit in 
the BC_XMAP_1 Register (GX_BASE+ 8004h[21]) 
to replace the A20M# pin on the 486 micropro- 
cessor. When the A20M bit is set high, all non-SMI 
accesses will have address bit 20 forced to zero. 
External hardware must do an SMI trap on I/O 
locations that toggle the A20M# pin. The SMI soft- 
ware can then change the A20M bit as desired. 


This will maintain compatibility with software that 
depends on wrapping the address at bit 20. 


4.2.3 SMI Generation 


The Internal Bus Interface Unit can generate SMI 
interrupts whenever an I/O cycle in the VGA 
address range is 3BOh-3BFh and 3C0h-3CFh. An 
I/O cycle to 3D0h-3DFh can be trapped. In case an 
external VGA card is present, the Internal Bus 
Interface Unit default values will not generate an 
interrupt on VGA accesses. (Refer to Section 
5.2.3.1 “SMI Generation” on page 195 for instruc- 
tions on how to configure the registers to generate 
the SMI interrupt.) 


4.2.4 640KB to 1MB Region 


There are 19 configurable memory regions located 
between 640KB and 1MB. Three of the regions are 
A0000h-AFFFFh, BOOOOh-B7FFFh, and B8000h- 
BFFFFh. The area between CO000h and FFFFFh 
is divided into 16KB segments to form the 
remaining 16 regions. Each of these regions has 
four control bits to allow any combination of read- 
access, write-access, cache, and PCl-access 
capabilities (Table 4-11 on page 115). 


In addition, each of the three regions defined in the 
A0000h-BFFFh area of memory has a VGA control 
bit that can cause the graphics pipeline to handle 
accesses to that section of memory (see Table 5-3 
on page 197). 
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4.2.5 Internal Bus Interface Unit Table 4-9 summarizes the four 32-bit registers 
Registers contained in the Internal Bus Interface Unit and 


The Internal Bus Interface Unit maps 100h loca- Tabled!) gives MeregISiSUDILleimals: 


tions starting at GX_BASE+8000h. Refer to 
Section 4.1.2 “Control Registers” on page 106 for 
instructions on accessing these registers. 


Table 4-9 Internal Bus Interface Unit Register Summary 


GX_BASE+ Default 
Memory Offset Type | Name/Function Value 
8000h-8003h R/W_ | BC_DRAM_TOP 3FFFFFFFh 


Top of DRAM — Contains the highest available address of system memory not 
including the memory that is set aside for graphics memory, which corresponds 
to 1 GByte of memory. The largest possible value for the register is 
3FFFFFFFh. 


8004h-8007h R/W BC_XMAP_1 00000000h 
Memory X-Bus Map Register 1 (A and B Region Control) — Contains the region 
control of the A and B regions and the SMI controls required for VGA emulation. 
PCI access to internal registers and the A20M function are also controlled by 
this register. 

8008h-800Bh R/W BC_XMAP_2 00000000h 
Memory X-Bus Map Register 2 (C and D Region Control) — Contains region 
control fields for eight regions in the address range COh through DCh. 
800Ch-800Fh R/W BC_XMAP_3 00000000h 


Memory X-Bus Map Register 3 (E and F Region Control) — Contains the region 
control fields for memory regions in the address range EOh through FCh. 
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Table 4-10 Internal Bus Interface Unit Registers 


Bit Name Description 
GX_BASE+8000h-8003h BC_DRAM_TOP Register (R/W) Default Value = 3FFFFFFFh 
31:30 RSVD Reserved: Set to 0. 
29:17 TOP OF Top of DRAM: Maximum value is FFFh. 
DRAM 
16:0 1FFFF Granularity: Must be set to 1FFFFh (128KB). 
GX_BASE+8004h-8007h BC_XMAP_1 Register (R/W) Default Value = 00000000h 
31:29 RSVD Reserved: Set to 0. 
28 GEB8 Graphics Enable for B8 Region — Allow memory R/W operations for address range B8000h- 


BFFFFh be directed to the graphics pipeline: 0 = Disable; 1 = Enable. 
(Used for VGA emulation.) 


27:24 B8 B8 Region: Region control field for address range B8000h-BFFFFh. 
Note: Refer to Table 4-11 for decode. 
23 RSVD Reserved: Set to 0. 
22 PRAE PCI Register Access Enable: Allow PCI Slave to access internal registers on the X-Bus: 
0 = Disable; 1 = Enable. 
21 A20M Address Bit 20 Mask: Address bit 20 is always forced to a zero except for SMI accesses: 
0 = Disable; 1 = Enable. 
20 GEBO Graphics Enable for BO Region: Allow memory R/W operations for address range BOOOOh-B7FFFh 


be directed to the graphics pipeline: 0 = Disable; 1 = Enable. 
(Used for VGA emulation.) 


19:16 BO BO Region: Region control field for address range BOO00h-B7FFFh. 
Note: Refer to Table 4-11 for decode. 
15 SMID SMID: All I/O accesses for address range 3D0h-3DFh generate an SMI: 0 = Disable; 1 = Enable. 
(Used for VGA virtualization.) 
14 SMIC SMIC: All I/O accesses for address range 3COh-3CFh generate an SMI: 0 = Disable; 1 = Enable. 
(Used for VGA virtualization.) 
13 SMIB SMIC: All I/O accesses for address range 3COh-3CFh generate an SMI: 0 = Disable; 1 = Enable 
(Used for VGA virtualization.) 
12:8 RSVD Reserved — Set to 0. 
7 XPD X-Bus Pipeline Disable: When cleared, the address for the next cycle can be driven on the internal 


X-Bus before the completion of the data phase of the current cycle. 


6 GNWS X-Bus Graphics Pipe No Wait State: Data driven on X-Bus from graphics pipeline: 
0 = 1 full clock before X_DSX is asserted 
1 = On the same clock in which X_RDY is asserted 


5 XNWS X-Bus No Wait State:— Data driven on X-Bus from Internal Bus Interface Unit: 
0 = 1 full clock before X_DSX is asserted 
1 = On the same clock in which X_RDY is asserted 


4 GEA Graphics Enable for A Region: Memory R/W operations for address range AOO0Oh-AFFFFh are 
directed to the graphics pipeline: 0 = Disable; 1 = Enable. 

(Used for VGA emulation.) 

3:0 AO AO Region: Region control field for address range AOO00h-AFFFFh. 

Note: Refer to Table 4-11 for decode. 


Page 114 Cyrix Corporation Confidential GXm_db_v2.0 


Internal Bus Interface Unit A | 


Table 4-10 Internal Bus Interface Unit Registers 


Bit Name Description 
GX_BASE+8008h-800Bh BC_XMAP_2 Register (R/W) Default Value = 00000000h 
31:28 DC DC Region: Region control field for address range DC000h to DFFFFh. 
27:24 D8 D8 Region: Region control field for address range D8000h to DBFFFh. 
23:20 D4 D4 Region: Region control field for address range D4000h to D7FFFh. 
19:16 DO DO Region: Region control field for address range DO000h to D3FFFh. 
15:12 CC CC Region: Region control field for address range CC000h to CFFFFh. 
11:8 C8 C8 Region: Region control field for address range C8000h to CBFFF. 
7:4 C4 C4 Region: Region control field for address range C4000h to C7FFFh. 
3:0 co CO Region: Region control field for address range CO000h to C3FFFh. 


Note: Refer to Table 4-11 for decode. 


GX_BASE+800Ch-800Fh BC_XMAP_3 Register (R/W) Default Value = 00000000h 

31:28 FC FC Region: Region control field for address range FCO000h to FFFFFh. 
27:24 F8 F8 Region: Region control field for address range F8000h to FBFFFh. 
23:20 F4 F4 Region: Region control field for address range F4000h to F7FFFh. 
19:16 FO FO Region: Region control field for address range FOO000h to F3FFFh. 
15:12 EC EC Region: Region control field for address range EC000h to EFFFFh. 
11:8 E8 E8 Region: Region control field for address range E8000h to EBFFFh. 
7:4 E4 E4 Region: Region control field for address range E4000h to E7FFFh. 

3:0 EO EO Region: Region control field for address range E0000h to E3FFFh. 


Note: Refer to Table 4-11 for decode. 


Table 4-11 Region-Control-Field Bit Definitions 


Bit 
Position | Function 


3 PCI Accessible:— The PCI slave can access this memory if this bit is set high and if the appropriate Read or Write 
Enable bit is also set high. 
2 Cache Enable: Caching this region of memory is inhibited if this bit is cleared. 


Write Enable: Write operations to this region of memory are allowed if this bit is set high. If this bit is cleared, then 
write operations in this region are directed to the PCI master. 

0 Read Enable: Read operations to this region of memory are allowed if this bit is set high. If this bit is cleared then read 
operations in this region are directed to the PCI master. 
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4.3 Memory Controller The SDRAM clock is a function of the core clock. 
The SDRAM bus can be run at speeds that range 
between 66MHz and 100MHZz. The core clock can 
be divided down by 2, 2.5, 3, 3.5, or 4 to generate 
the SDRAM clock. 


The memory controller operates with the Processor 
Interface (X-Bus), Display Controller Interface, 
Graphics Pipeline Interface, and the SDRAM Inter- 
face. 


A basic block diagram of the memory controller is 


The MediaGX processor supports LVTTL (low eat 
shown in Figure 4-3. 


voltage TTL) technology. LVTTL technology allows 
the SDRAM interface of the memory controller to 
run at frequencies up to 125MHz. 


REF 
ar a fee Processor I/F DQMI7:0] 
Eee een en 
; SDRAM + ant CASA#,CASB# 
Display Controller P Timing 4 
Control Display Controller I/F Arbiter Sequence Controller CSI3:0}# 


Controller 
WEA#/WEB# 


CKEA#, CKEB# 


Graphics oe Graphics Pipeline I/F 


Processor/PCI Address . 
P| Address ee 


Display Controller Address Control/MUX 
Graphics Pipeline Address BA[1:0] 


Processor/PCl 
Eipeeeson EEbDal Write Buffer (16 Bytes) 
: Display Controller 
Display Controller Data Write Buffer (16 Bytes) sists 
: eT Graphics Controller 
Graphics Pipeline Data Wie Bante (16 Bytes) 


Read Buffer 
(16 Bytes) 
Clock Divid 
Core Clock (ph2) Boe ae aie SDCLK[3:0] 


Figure 4-3 Memory Controller Block Diagram 
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4.3.1 Memory Array Configuration For example, 16Mb SDRAMS have two component 
The memory controller supports up to two 64-bit, banks and 64Mb SDRAMs have two or four 
168-pin unbuffered SDRAM modules (DIMM). component banks. For single DIMM bank modules, 
Each DIMM receives a unique set of RAS, CAS the memory controller can support two DIMMS with 
WE, and CKE lines. Each DIMM can have one or a maximum of eight component banks. For dual 
iwo 64-bit DIMM banks. Each DIMM bank is DIMM bank modules, the memory controller can 


selected by a unique chip select (CS). There are support two DIMMs with a maximum of 16 compo- 
four chip select signals to choose between a total nent banks. Up to 16 banks can be open at the 

of four DIMM banks. Each DIMM bank also same time. 

receives a unique SDCLK. Each DIMM bank can 

have two or four component banks. Component 

bank selection is done through the bank address 

(BA) lines. 


Bank 0 Bank 1 
MA[12:0] ——————en A[12:0] A[12:0] 
ad BAI1:0] 
MD[63:0] 
: DQMI7:0] 
RASA# RAS# 
CASA# CAS# 
WEA# WE# 
CS0# SO#, S2# | 
CS1# S1#, S3# 
CKEA CKEO | CKE1 
SDCLKO CKO, Ck2 | 
SDCLK1 CK1, CK3 


MediaGX™ 
MMX™-Enhanced 


Processor BankO - Bank 1 
Ali2:0) | alt2:0] 
BA[1:0] | BA[1:0] 
MD[63:0] _, MD[63:0] 
pami7:0] | pami7:0] 
RASB# RAS# RAS# 
CASB# CAS# CAS# 
WEB# WE# WE# 
CS2# SO#, S2# | 
CS3# S1#, S3# 


CKEB CKEO | CKE1 
SDCLK2 CKO, CK2 | 
SDCLK3 CK1, CK3 


Figure 4-4 Memory Array Configuration 
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4.3.2 Memory Organizations tions. Supported configurations are shown in Table 
The memory controller supports JEDEC standard 4-12. 
synchronous DRAMSs in 16Mb and 64Mb configura- 
Table 4-12 Synchronous DRAM Configurations 
Row Column Bank Total # of 
Depth Organization Address Address Address Address bits 
1 1Mx16 A10-A0 A7-A0 BAO 20 
2 2Mx8 A10-A0 A8-A0 BAO 21 
2Mx32 A10-A0 A7-A0 BA1-BAO 21 
2Mx32 A10-A0 A8-A0 BAO 21 
2Mx32 A11-A0 A6-A0 BA1-BAO 21 
2Mx32 A12-A0 A6-A0 BAO 21 
4 4Mx4 A10-A0 A9-A0 BAO 22 
4Mx16 A11-A0 A7-A0 BA1-BAO 22 
4Mx16 A12-A0 A7-A0 BAO 22 
4Mx16 A10-A0 A9-A0 BAO 22 
8 8Mx8 A11-A0 A8-A0 BA1-BAO 23 
8Mx8 A12-A0 A8-A0 BAO 23 
8Mx32 A11-A0 A8-A0 BA1-BAO 23 
8Mx32 A12-A0 A7-A0 BA1-BAO 23 
16 16Mx4 A11-A0 A9-A0 BA1-BAO 24 
16Mx4 A12-A0 A9-A0 BAO 24 
16Mx16 A12-A0 A8-A0 BA1-BAO 24 
16Mx16 A11-A0 A9-A0 BA1-BAO 24 
32 32Mx8 A12-A0 A9-A0 BA1-BAO 25 
64 64Mx4 A12-A0 A9-A0,A11 BA1-BAO 26 
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4.3.3 SDRAM Commands 

This subsection discusses the SDRAM commands 
supported by the memory controller. Table 4-13 
summarizes these commands followed by detailed 
operational information regarding each command. 


Table 4-13 Basic Command Truth Table 


Name Command CS | RAS | CAS | WE 

MRS Mode Register Set L L L L 

PRE Bank Precharge L L. H L 

ACT Bank activate/row- L L H H 
address entry 

WRT Column address L H L L 
entry/Write operation 

READ | Column address L H L H 
entry/Read operation 

DESL Control input inhibit/ H X X X 
No operation 

REFR* | CBR Refresh or Auto L L L H 
Refresh 


Note: *This command is CBR (CAS-before-RAS) refresh 
when CKE is high and self refresh when CKE is low. 


MRS — The Mode Register command defines the 
specific mode of operation of the SDRAM. This 
definition includes the selection of burst length, 
burst type, and CAS latency. CAS latency is the 
delay, in clock cycles, between the registration of a 
read command and the availability of the first piece 
of output data. 


The burst length is programmed by address bits 
MA[2:0], the burst type by address bit MA3 and the 
CAS latency by address bits MA[6:4]. 


The memory controller only supports a burst length 
of two and burst type of interleave. 


The field value on MA[12:0] and BA[1:0] during the 
MRS cycle are as shown in Table 4-14. 


PRE — The precharge command is used to deacti- 
vate the open row in a particular bank or the open 
row in both component banks. Address pin MA10 
determines whether one or both banks are to be 
precharged. In the case where only one compo- 
nent bank is to be precharged, BA[1:0] selects 
which bank. Once a bank has been precharged, it 
is in the Idle state and must be activated prior to 
any read or write commands. 


Table 4-14 Address Line Programming during MRS Cycles 


BA[1:0] MA[12:7] MAI[6:4] 


MA3 MA2 MA1 MAO 


00 000000 CAS Latency: 
000 = Reserved 
010 =2CLK 
100 = 4 CLK 
110 =6 CLK 
001 =1 CLK 
011 =3 CLK 
101 =5 CLK 
111 =7 CLK 
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ACT — The activate command is used to open a 
row in a particular bank for a subsequent access. 
The value on the BA lines selects the bank, and the 
address on the MA lines selects the row. This row 
remains open for accesses until a precharge 
command is issued to that bank. A precharge 
command must be issued before opening a 
different row in the same bank. 


READ — The read command is used to initiate a 
burst read access to an active row. The value on 
the BA lines select the component bank, and the 
address provided by the MA lines select the 
starting column location. The memory controller 
does not perform auto precharge during read oper- 
ations. Valid data-out from the starting column 
address is available following the CAS latency after 
the read command. The DQM signals are asserted 
low during read operations. 


WRT — The write command is used to initiate a 
burst write access to an active row. The value on 
the BA liens select the component bank, and the 
address provided by the MA lines select the 
starting column location. The memory controller 
does not perform auto precharge during write oper- 
ations. This leaves the page open for subsequent 
accesses. Data appearing on the MD lines is 
written to the DQM logic level appearing coincident 
with the data. If the DQM signal is registered low, 
the corresponding data will be written to memory. If 
the DQM is driven high, the corresponding data will 
be ignored, and a write will not be executed to that 
location. 


REF — Auto refresh is used during normal opera- 
tion and is analogous to the CAS-before-RAS 
(CBR) refresh in conventional DRAMs.During auto 
refresh the address bits are "don’t care". The 
memory controller precharges all banks prior to an 
auto refresh cycle. Auto refresh cycles are issued 
approximately 15us apart. 


The self refresh command is used to retain data in 
the SDRAMs even when the rest of the system is 
powered down. The self refresh command is 
similar to an auto refresh command except CKE is 
disabled (low). The memory controller issues a self 
refresh command during 3V Suspend mode when 
all the internal clocks are stopped. 


4.3.3.1 SDRAM Initialization Sequence 
After the clocks have started and stabilized, the 
memory controller SDRAM initialization sequence 
begins: 


1) Precharge all component banks, 
2) perform eight refresh cycles, 
3) followed by an MRS cycle, 

4) followed by eight refresh cycles. 


This sequence is compatible with the majority of 
SDRAMs available from the various vendors. 
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4.3.4 Memory Controller Register Table 4-15 summarizes the 32-bit registers 
Description contained in the memory controller. Table 4-16 


The Memory Controller maps 100h locations gives detalceiTeg islet DiILNonmals. 


starting at GX_BASE+8400h. Refer to Section 
4.1.2 “Control Registers” on page 106 for instruc- 
tions on accessing these registers. 


Table 4-15 Memory Controller Register Summary 


GX_BASE+ 
Memory Offset Type | Name/Function Default Value 
8400h-8403h R/W MC_MEM_CNTRL1 248C0040h 


Memory Controller Control Register 1 — Memory controller configuration infor- 
mation e.g., refresh interval, SDCLK ratio, etc. 


8404h-8407h R/W MC_MEM_CNTRL2 00000801h 


Memory Controller Control Register 2 — Memory controller configuration infor- 
mation to control SDCLK. 

8408h-840Bh R/W MC_BANK_CFG 41104110h 
Memory Controller Bank Configuration — Contains the configuration informa- 
tion for the each of the two DIMMs in the memory array. BIOS programs this 
register during boot by running an autosizing routine on the memory. 
840Ch-840Fh R/W MC_SYNC_TIM1 2A733225h 
Memory Controller Synchronous Timing Register 1 — SDRAM memory timing 
information - This register controls the memory timing of all four banks of 
DRAM. BIOS programs this register based on the processor frequency and the 
SDCLK divide ratio. 

8414h-8417h R/W MC_GBASE_ADD 00000000h 
Memory Controller Graphics Base Address Register — This register sets the 
graphics memory base address, which is programmable on 512KB boundaries. 
The display controller and the graphics pipeline generate a 20-bit DWORD off- 
set that is added to the graphics memory base address to form the physical 
memory address. Typically, the graphics memory region is located at the top of 
physical memory. 


8418h-841Bh R/W MC_DR_ADD 00000000h 


Memory Controller Dirty RAM Address Register — This register is used to set 
the Dirty RAM address index for processor diagnostic access. This register 
should be initialized before accessing the MC_DR_ACC register 


841Ch-841Fh R/W MC_DR_ACC 0000000xh 


Memory Controller Dirty RAM Access Register — This register is used to 
access the Dirty RAM. A read/write to this register will access the Dirty RAM at 
the address specified in the MC_DR_ADD register. 
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Table 4-16 Memory Controller Registers 


Bit Name Description 
GX_BAS+ 8400h-8403h MC_MEM_CNTRL1 (R/W) Default Value = 248C0040h 
31:29 MDHDCTL MD High Drive Control: Controls the high drive and slew rate of the memory data bus (MD[63:0)): 
000 = Tristate 
001 = Smallest drive strength 
010-110 = Represents gradual drive strength increase 
111 = Highest drive strength 
28:26 MABAHDCTL | MA/BA High Drive Control: Controls the high drive and slew rate of the memory address bus 
including the memory bank address bus (MA[12:0] and BA[1:0]): 
000 = Tristate 
001 = Smallest drive strength 
010-110 = Represents gradual drive strength increase 
111 = Highest drive strength 
25:23 MEMHDCTL _| Control High Drive/Slew Control: Controls the high drive and slew rate of the memory control sig- 
nals (CASA#, CASB#, RASA#, RASB#, CKEA, CKEB, WEA#, WEA#, DQM[7:0], and CS[3:0]#): 
000 = Tristate 
001 = Smallest drive strength 
010-110 = Represents gradual drive strength increase 
111 = Highest drive strength 
22 RSVD Reserved: Set to 0. 
21 RSVD Reserved: Must be set to 0. Wait state on the X-Bus x_data during read cycles - for debug only. 
20:18 SDCLKRATE | SDRAM Clock Ratio: Selects SDRAM clock ratio: 
000 = Reserved 100 =+3.5 
001=+2 101=+4 
010 =+2.5 110=+4.5 
011 = + 3 (Default) 111=+5 
Ratio does not take effect until the SDCLKSTRT bit (bit 17 of this register) transitions from 0 to 1. 
17 SDCLKSTRT | Start SDCLK: Start operating SDCLK using the new ratio and shift value (selected in bits [20:18] of 
this register): 0 = Clear; 1 = Enable. 
This bit should be cleared every time before a one is written to it in order to start SDCLK or to 
change the shift value. 

16:8 RFSHRATE Refresh Interval: This field determines the number of processor core clocks multiplied by 64 
between refresh cycles to the DRAM. By default, the Refresh Interval is OOh. This implies that 
refresh is turned off by default. 

7:6 RFSHSTAG Refresh Staggering: This field determines number of clocks between REF commands to different 
banks during refresh cycles: 

00 = 0 SDRAM clocks 10 = 2 SDRAM clocks 
01 = 1 SDRAM clocks (Default) 11 =4 SDRAM clocks 
Staggering is used to help reduce power spikes during refresh. When only DIMM60O is installed and it 
has only one DIMM bank, then this field must be set to 00. 
5 2CLKADDR Two Clock Address Setup: Assert memory address for one extra clock before CS# is asserted: 
0 = Disable; 1 = Enable. 
This can be used to compensate for address setup at high frequencies. 
4 RFSHTST Test Refresh: This bit, when set high, generates a refresh request. This bit is only used for testing 


purposes. 
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Table 4-16 Memory Controller Registers (cont.) 


Bit Name Description 

3 XBUSARB X-Bus Round Robin: When enabled, processor requests are arbitrated at the same priority level 
than graphics pipeline requests and non-critical display controller requests. When disabled, proces- 
sor requests are arbitrated at a higher priority level. High priority display controller requests always 
have the highest arbitration priority: 0 = Enable; 1 = Disable. 

2 VGAWRP VGA Wrap Enable: Allow memory wrapping into the VGA memory address space from A0000h to 
BFFFFh: 0 = Disable; 1 = Enable. 

1 RSVD Reserved: Set to 0. 

0 SDRAMPRG_| Program SDRAM: When this bit is set the memory controller will program the SDRAM MRS register 


using LTMODE in MC_SYNC_TIM1. 
This bit should be cleared every time before a one is written to it in order to program the SDRAM. 


GX_BASE+8404h-8407h 


MC_MEM_CNTRL2 (R/W) Default Value = 00000801h 


31:18 RSVD Reserved: Set to 0. 
17:16 SDCLKRISE | SDCLK Rising Delay: Controls the delay between the core clock and the rising edge of SDCLK dur- 
ing all modes. (Set by BIOS.) 
15:14 SDCLKFALL | SDCLK Falling Delay: Controls the delay between the core clock and the falling edge of SDCLK 
during 2.5 and 3.5 clock modes. (Set by BIOS.) 
13:11 SDCLKHDCTL | SDCLK High Drive/Slew Control: Controls the high drive and slew rate of SDCLK[3:0] and 
SDCLK_OUT. 
000 = Highest drive strength. (No braking applied in the pads) 
001 = Smallest drive strength 
010-110 = Represent gradual drive strength increase 
111 = Highest drive strength 
10 SDCLKOMSK_ | Mask SDCLK_OUT: 0 = Not masked; 1 = Mask. 
9 SDCLK3MSK_ | Mask SDCLK3: 0 = Not masked; 1 = Mask. 
8 SDCLK2MSK_ | Mask SDCLK2: 0 = Not masked; 1 = Mask 
7 SDCLK1MSK_ | Mask SDCLK1: 0 = Not masked; 1 = Mask. 
6 SDCLKOMSK_ | Mask SDCLKO: 0 = Not masked; 1 = Mask 
5:3 SHFTSDCLK | Shift SDCLK: This function allows shifting SDCLK to meet SDRAM setup and hold time require- 
ments. The shift function will not take effect until the SDCLKSTRT bit (bit 17 of MC_MEM_CNTRL1) 
transitions from 0 to 1: 
000 = No shift 100 = Shift 2 core clocks 
001 = Shift 0.5 core clock 101 = Shift 2.5 core clocks 
010 = Shift 1 core clock 110 = Shift 3 core clocks 
011 = Shift 1.5 core clock 111 = Reserved 
Note: Refer to Figure 4-10 for an example of SDCLK shifting. 
2 RSVD Reserved: Set to 0. 
1 RD Read Data Phase: Selects if read data is latched one or two core clock after the rising edge of 
SDCLK: 0 = 1 core clock; 1 = 2 core clocks. 
0 FSTRDMSK Fast Read Mask: Do not allow core reads to bypass the request FIFO: 0 = Disable; 1 = Enable. 
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Table 4-16 Memory Controller Registers (cont.) 


Bit Name Description 
GX_BASE+8408h-840Bh MC_BANK_CFG (R/W) Default Value = 41104110h 
31 RSVD Reserved: Set to 0. 
30 DIMM1_ DIMM1 Module Banks: Selects the number of module banks per DIMM for DIMM1: 
MOD_BNK | 0 = 1 Module bank 
1 = 2 Module banks 
29 RSVD Reserved: Set to 0. 
28 DIMM1_ DIMM1 Component Banks: Selects the number of component banks per module bank for DIMM1: 
COMP_BNK | 0 = 2 Component banks 
1 = 4 Component banks 
27 RSVD Reserved: Set to 0. 
26:24 DIMM1_SZ DIMM1 Size: Selects the size of DIMM1: 
000 = 4MB 010 = 16MB 100 = 64MB 110 = 256MB 
001 = 8MB 011 = 32MB 101 = 128MB 111 = 512MB 
23 RSVD Reserved: Set to 0. 
22:20 DIMM1_PG_SZ | DIMM1 Page Size — Selects the page size of DIMM1: 
000 = 1KB 010 = 4KB 1xx = 16KB 
001 = 2KB 011 = 8KB 111 = DIMM1 not installed 
When DIMM1 is not installed, program all other DIMM1 fields to 0. 
19:15 RSVD Reserved: Set to 0. 
14 DIMMO_ DIMMO Module Banks: Selects number of module banks per DIMM for DIMMO: 
MOD_BNK | 0 = 1 Module bank 
1 = 2 Module banks 
13 RSVD Reserved — Set to 0. 
12 DIMMO_ DIMMO Component Banks: Selects the number of component banks per module bank for DIMMO: 
COMP_BNK | 0 = 2 Component banks 
1 = 4 Component banks 
11 RSVD Reserved: Set to 0. 
10:8 DIMMO_SZ DIMMO Size: Selects the size of DIMM1: 
000 = 4MB 010 = 16MB 100 = 64MB 110 = 256MB 
001 = 8MB 011 = 32MB 101 = 128MB 111 = 512MB 
7 RSVD Reserved: Set to 0. 
6:4 DIMMO_PG_SZ | DIMMO Page Size: Selects the page size of DIMMO: 
000 = 1KB 010 = 4KB 1xx = 16KB 
001 = 2KB 011 = 8KB 111 = DIMMO not installed 
When DIMM0 is not installed, program all other DIMM0 fields to 0. 
3:0 RSVD Reserved: Set to 0. 
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Table 4-16 Memory Controller Registers (cont.) 


Bit Name Description 
GX_BASE+840Ch-840Fh MC_SYNC_TIM1 (R/W) Default Value = 2A733225h 
31 RSVD Reserved: Set to 0. 
30:28 LTMODE CAS Latency (LTMODE): CAS latency is the delay, in clock cycles, between the registration of a 


read command and the availability of the first piece of output data (BIOS interrogates EEPROM 
across the I?C interface to determine this value): 


000 = Reserved 010 =2 CLK 100 = 4 CLK 110 =6 CLK 
001 = 1 CLK 011 =3 CLK 101 =5 CLK 111 =7 CLK 


This field will not take effect until SDRAMPRG (bit 0 of MC_MEM_CNTRL1) transitions from 0 to 1. 
ERRATA: CAS Latency of 1 CLK is not currently supported. 


27:24 RC REF to REF/ACT Command Period (tRC): Minimum number of SDRAM clock between REF and 
REF/ACT commands: 
0000 = Reserved 0100=5 CLK 1000 =9 CLK 1100 = 13 CLK 
0001 = 2 CLK 0101 =6 CLK 1001 = 10 CLK 1101 = 14 CLK 
0010 = 3 CLK 0110 = 7 CLK 1010 = 11 CLK 1110 = 15 CLK 
0011 = 4 CLK 0111 = 8 CLK 1011 = 12 CLK 1111 = 16 CLK 
23:20 RAS ACT to PRE Command Period (tRAS): Minimum number of SDRAM clocks between ACT and 
PRE commands: 
0000 = Reserved 0100=5 CLK 1000 = 9 CLK 1100 = 13 CLK 
0001 = 2 CLK 0101 =6 CLK 1001 = 10 CLK 1101 =14 CLK 
0010 = 3 CLK 0110 = 7 CLK 1010 = 11 CLK 1110 = 15 CLK 
0011 = 4 CLK 0111 = 8 CLK 1011 = 12 CLK 1111 = 16 CLK 
19 RSVD Reserved — Set to 0. 
18:16 RP PRE to ACT Command Period (tRP): Minimum number of SDRAM clocks between PRE and ACT 
commands: 
000 = Reserved 010 =2 CLK 100 = 4 CLK 110 =6 CLK 
001 =1 CLK 011 =3 CLK 101 =5 CLK 111 =7 CLK 
15 RSVD Reserved — Set to 0. 
14:12 RCD Delay Time ACT to READ/WRITE Command (tRCD): Minimum number of SDRAM clock between 
ACT and READ/WRITE commands: 
000 = Reserved 010 =2 CLK 100 = 4 CLK 110 =6 CLK 
001 = 1 CLK 011 =3 CLK 101 =5 CLK 111 =7 CLK 
11 RSVD Reserved: Set to 0. 
10:8 RRD ACT(0) to ACT(1) Command Period (tRRD): Minimum number of SDRAM clocks between ACT 


and ACT command to two different component banks within the same module bank. The memory 
controller does not perform back-to-back Activate commands to two different component banks with- 
out a READ or WIRTE command between them. Hence, this field should be set to 001. 


7 RSVD Reserved: Set to 0. 
6:4 DPL Data-in to PRE command period (tDPL): Minimum number of SDRAM clocks from the time the 
last write datum is sampled till the bank is precharged: 
000 = Reserved 010 =2 CLK 100 =4 CLK 110 =6 CLK 
001 = 1 CLK 011 =3 CLK 101 =5 CLK 111 =7 CLK 
3:0 RSVD Reserved: Set to 0 or leave unchanged. 
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Table 4-16 Memory Controller Registers (cont.) 


Bit Name Description 
GX_BASE+8414h-8417h MC_GBASE_ADD (R/W) Default Value = 00000000h 
31:18 RSVD Reserved: Set to 0. 
17 TE Test Enable TEST[3:0]: 
0 = TEST[3:0] are driven low 
1 = TEST[3:0] pins are used to output test information 
16 TECTL Test Enable Shared Control Pins: 
0 = RASB#, CASB#, CKEB, WEB# are driven low 
1 = RASB#, CASB#, CKEB, WEB# are used to output test information 
15:12 SEL Select: This field is used for debug purposes only. 
11 RSVD Reserved: Set to 0. 
10:0 GBADD Graphics Base Address: This field indicates the graphics memory base address, which is pro- 
grammable on 512KB boundaries. This field corresponds to address bits [29:19]. 
Note that BC_DRAM_TOP must be set to a value lower than the Graphics Base Address. 


GX_BASE+8418h-841Bh 


MC_DR_ADD (R/W) Default Value = 00000000h 


31:10 


RSVD 


Reserved: Set to 0. 


9:0 


DRADD 


Dirty RAM Address: This field is the address index that is used to access the Dirty RAM with the 
MC_DR_ACC register. This field does not auto increment. 


GX_BASE+841Ch-841Fh 


MC_DR_ACC (R/W) Default Value = 0000000xh 


31:2 RSVD Reserved: Set to 0. 
1 D Dirty Bit: This bit is read/write accessible. 
0 Vv Valid Bit: This bit is read/write accessible. 


Page 126 


Cyrix Corporation Confidential GXm_db_v2.0 


Memory Controller A | 


4.3.5 Address Translation 


The memory controller supports two address trans- 
lations depending on the method used to interleave 
pages. 


4.3.5.1 High Order Interleaving 


High Order Interleaving (HOI) uses the most signif- 
icant address bits to select which bank the page is 
located in. This has the affect of allowing any 
mixture of DIMM types. However, it soreads the 
pages over wide address ranges. For example, two 
8MB DIMMs contain a total of four component 
pages. Two pages are together in one DIMM sepa- 
rated from the other two pages by 8MB. 


4.3.5.2 Low Order Interleaving 


Low Order Interleaving (LOI) uses the least signifi- 
cant bits after the page bits to select which bank 
the page is located in. This requires that memory is 
a power of 2, that the number of banks is a power 
of 2, and that the page sizes are the same. In other 
words, the DIMMs have to be of the same type. 
However, LOI does give a good benefit by 
providing a moving page throughout memory. 
Using the same example as above, two banks 
would be on one DIMM and the next two banks 
would be on the second DIMM, but they would be 
linear in address space. For an eight bank system 


that has 1KB address (8KB data) pages, there 
would be an effective moving page of 64KB of 
data. 


4.3.5.3 Physical Address to DRAM 
Address Conversion 


Auto LOI is in effect whenever the two DIMMs have 
the same number of DIMM banks, component 
banks, module sizes and page sizes. 


Tables 4-17 and 4-18 give Auto LOI address 
conversion examples when two DIMMs of the 
same size are used in a system. Table 4-17 shows 
a one DIMM bank conversion example, while Table 
4-18 shows a two DIMM bank example. 


Tables 4-19 and 4-20 give Non-Auto LOI address 
conversion examples when either one or two 
DIMMs of different sizes are used in a system. 
Table 4-19 shows a one DIMM bank address 
conversion example, while Table 4-20 shows a two 
DIMM bank example. The addresses are 
computed on a per DIMM basis. 


Since the DRAM interface is 64 bits wide, the lower 
three bits of the physical address get mapped onto 
the DQM[7:0] lines. Thus, the address conversion 
tables (Tables 4-17 through 4-20) show the phys- 
ical address starting from A3. 
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Table 4-17 Auto LOI -- 2 DIMMs, Same Size, 1 DIMM Bank 


1K Page Size 2K Page Size 4K Page Size 1K Page Size 2K Page Size 4K Page Size 
Row Col Row Col Row Col Row Col Row Col Row Col 
Address 2 Component Banks 4 Component Banks 
MA12 A24 -- A25 -- A26 A25 -- A26 -- A27 
MA11 A23 -- A24 -- A25 A24 -- A25 -- A26 
MA10 A22 -- A23 -- A24 A23 -- A24 -- A25 
MAQ A21 -- A22 -- A23 A22 AQ A23 -- A24 
MA8 A20 -- A21 -- A22 Alt A2t A8 A22 -- A23 Alt 
MA7 A19 -- A20 A10 A2t A10 A20 A7 A2t A10 A22 A10 
MA6 A18 AQ A19 AQ A20 AQ A19 A6 A20 AQ A21 AQ 
MA5 A17 A8 A18 A8 A19 A8 A18 A5 A19 A8 A20 A8 
MA4 A16 A7 A17 A7 A18 A7 A17 A4 A18 A7 A19 A7 
MA3 A15 A6 A16 A6 A17 A6 A16 A3 A17 A6 A18 A6 
MA2 A14 A5 A15 A5 A16 A5 A15 A8 A16 A5 A17 A5 
MA1 A13 A4 A14 A4 A15 A4 A14 A7 A15 A4 A16 A4 
MAO A12 A3 A13 A3 A14 A3 A13 A6 A14 A3 A15 A3 
CS0/CS1 All A12 A13 A12 A13 A14 
CS2/CS3 #2 * a - - a 
BAO/BA1 A10 Alt A12 A11/A10 A12/A11 A13/A12 


Table 4-18 Auto LOI -- 2 DIMMs, Same Size, 2 DIMM Banks 


1K Page Size 2K Page Size 4K Page Size 1K Page Size 2K Page Size 4K Page Size 
Row Col Row Col Row Col Row Col Row Col Row Col 
Address 2 Component Banks 4 Component Banks 
MA12 A25 -- A26 -- A27 A26 -- A27 -- A28 -- 
MA11 A24 -- A25 -- A26 A25 -- A26 -- A27 -- 
MA10 A23 -- A24 -- A25 A24 -- A25 -- A26 -- 
MAQ A22 -- A23 -- A24 A23 -- A24 -- A25 -- 
MA8 A21 -- A22 -- A23 Alt A22 -- A23 -- A24 Alt 
MA7 A20 -- A21 A10 A22 A10 A21 -- A22 A10 A23 A10 
MA6 A19 AQ A20 AQ A21 AQ A20 AQ A21 AQ A22 AQ 
MAS5 A18 A8 A19 A8 A20 A8 A19 A8 A20 A8 A21 A8 
MA4 A17 A7 A18 A7 A19 A7 A18& A7 A19 A7 A20 A7 
MA3 A16 A6 A17 A6 A18 A6 A17 A6 A18 A6 A19 A6 
MA2 A15 A5 A16 A5 A17 A5 A16 A5 A17 A5 A18 AS 
MA1 A14 A4 A15 A4 A16 A4 A15 A4 A16 A4 A17 A4 
MAO A13 A3 A14 A3 A15 A3 A14 A3 A15 A3 A16 A3 
CS0/CS1 Al2 A13 A14 A13 A14 A15 
CS2/CS3 Alt Al2 A13 A12 A13 A14 
BAO/BA1 A10 Alt Al2 A11/A10 A12/A11 A13/A12 
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Table 4-19 Non-Auto LOI -- 1 or 2 DIMMs, Different Sizes, 1 DIMM Bank 


1K Page Size 2K Page Size 4K Page Size 1K Page Size 2K Page Size 4K Page Size 
Row Col Row Col Row Col Row Col Row Col Row Col 
Address 2 Component Banks 4 Component Banks 
MA12 A23 -- A24 -- A25 -- A24 -- A25 -- A26 
MA11 A22 -- A23 -- A24 -- A23 -- A24 -- A25 
MA10 A21 -- A22 -- A23 -- A22 -- A23 -- A24 
MAQ A20 -- A21 -- A22 -- A21 -- A22 -- A23 
MA8 A19 -- A20 -- A21 Alt A20 -- A21 -- A22 Alt 
MA7 A18 -- A19 A10 A20 A10 A19 -- A20 A10 A2t A10 
MA6 A17 AQ A18& AQ A19 AQ A18 AQ A19 AQ A20 AQ 
MA5 A16 A8 A17 A8 A18 A8 A17 A8 A18 A8 A19 A8 
MA4 A15 A7 A16 A7 A17 A7 A16 A7 A17 A7 A18 A7 
MA3 A14 A6 A15 A6 A16 A6 A15 A6 A16 A6 A17 A6 
MA2 A13 AS A14 A5 A15 A5 A14 A5 A15 AS A16 AS 
MA1 Al2 A4 A13 A4 A14 A4 A13 A4 A14 A4 A15 A4 
MAO Alt A3 A12 A3 A13 A3 A12 A3 A13 A3 A14 A3 
CS0/CS1 - = 7 =n a5 SS 
CS2/CS3 - 7 ca = a = 
BAO/BA1 A10 Alt Al2 A11/A10 A12/A11 A13/A12 


Table 4-20 Non-Auto LOI -- 1 or 2 DIMMs, Different Sizes, 2 DIMM Banks 


1K Page Size 2K Page Size 4K Page Size 1K Page Size 2K Page Size 4K Page Size 
Row Col Row Col Row Col Row Col Row Col Row Col 
Address 2 Component Banks 4 Component Banks 
MA12 A24 -- A25 -- A26 -- A25 -- A26 -- A27 -- 
MA11 A23 -- A24 -- A25 -- A24 -- A25 -- A26 -- 
MA10 A22 -- A23 -- A24 -- A23 -- A24 -- A25 -- 
MAQ A21 -- A22 -- A23 -- A22 -- A23 -- A24 -- 
MA8 A20 -- A21 -- A22 Alt A21 -- A22 -- A23 Alt 
MA7 A19 -- A20 A10 A21 A10 A20 -- A21 A10 A22 A10 
MA6 A18 AQ A19 AQ A20 AQ A19 AQ A20 AQ A21 AQ 
MA5 A17 A8 A18 A8 A19 A8 A18 A8 A19 A8 A20 A8 
MA4 A16 A7 A17 A7 A18 A7 A17 A7 A18 A7 A19 A7 
MA3 A15 A6 A16 A6 A17 A6 A16 A6 A17 A6 A18 A6 
MA2 A14 AS A15 A5 A16 A5 A15 A5 A16 A5 A17 AS 
MA1 A13 A4 A14 A4 A15 A4 A14 A4 A15 A4 A16 A4 
MAO Al2 A3 A13 A3 A14 A3 A13 A3 A14 A3 A15 A3 
CS0/CS1 A111 Al2 A13 Al2 A13 A14 
CS2/CS3 - = 
BAO/BA1 A10 Alt A12 A11/A10 A12/A11 A13/A12 
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4.3.6 Memory Cycles SDRAM Read Cycle 

Figures 4-5 through 4-8 illustrate various memory Figure 4-5 shows a SDRAM read cycle. The figure 
cycles that the memory controller supports. The assumes that a previous Activate command has 
following subsections describe some of the presented the row address for the read operation. 
supported cycles. Note that the burst length for the READ command 


is always two. 


col > 
DQM | \ / | 
Mo ex XD 


Figure 4-5 Basic Read Cycle with a CAS Latency of Two 
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SDRAM Write Cycle 
Figure 4-6 shows a SDRAM write cycle. The burst length for the WRITE command is 2. 


SDCLK | | | | | | | | | | 
| ee 


WE# 


a. OS )=3— 


vo | X01 | Qa 
com 7X | > 


Figure 4-6 Basic Write Cycle 
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SDRAM Refresh Cycle 


Figure 4-7 shows a SDRAM auto refresh cycle. 


The memory controller always precedes the 
refresh cycle with a Precharge command to all 
banks. 


Page Miss 

Figure 4-8 shows a Read/Write command after a 
page miss cycle. In order to program the new row 
address, a Precharge command must be issued 
followed by an Activate command. 


Figure 4-7 Auto Refresh Cycle 


souk | LP LJ LE LI LI LI Ld | 
COMMAND PRE X NOP XNOP XACT XNOPX NOP X RMX NOP > 


~_— tRP | {RCD — 


Figure 4-8 Read/Write Command to a New Row Address 
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4.3.7 SDRAM Interface Clocking 


The MediaGX processor drives the SDCLK to the 
SDRAMs; one for each DIMM bank. All the control, 
data, and address signals driven by the memory 
controller are sampled by the SDRAM at the rising 
edge of SDCLK. SDCLKOUT is a reference signal 
used to generate SDCLKIN. Read data is sampled 
by the memory controller at the rising edge of 
SDCLKIN. 


The delay for SDCLKIN must be designed so that it 
lags the SDCLKs at the DRAM by approximately 
2ns. The delay should also include the SDCLK 
transmission line delay. The SDCLK traces on the 
board need to be laid out so there is no skew 
between each of the four sinks. These guidelines 
allow the memory interface to be closer to the 
DRAM specifications. They improve performance 
by running the SDCLK up to frequencies of 
100MHz and a CAS latency of two. 


SDCLK[3:0] 


SDCLKOUT 


MediaGX™ 
MMX™-Enhanced 
Processor 


SDCLKO 


DIMM 
SDCLK1 0 


SDCLK2 


SDCLK3 


Figure 4-9 SDCLKIN Clocking 
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The SDRAM interface timings are programmable. plied up. The memory controller runs off this 

The SHFTSDCLK bits in the MC_MEM_CNTRL2 processor clock. The memory clock is generated 
register can be used to change the relationship by dividing down the processor clock. SDCLK is 
between SDCLK and the control/address/data generated from the memory clock. In the example 
signals. To meet setup and hold time requirements diagram, the processor clock is running 6X times 
for SDRAM across different board layouts, the the PCI clock and the memory clock is running in 
SHFTSDCLK bits are used. SHFTSDCLK bit divide by 3 mode. 


values are selected based upon the SDRAM 
signals loads and the core frequency (refer to Table © 'he SDRAM control, address, and data signals are 


7-10 in Section 7.6 “AC Characteristics”). driven off edge "x" of the memory clock to be setup 
before edge "y". With no shift applied, the control 


Figure 4-10 shows an example of how the SHFTS- signals could end up being latched on edge "x". A 
DCLK bits setting effects SDCLK. The PCI clock is shift value of two or three could be used so that 
the input clock to the MediaGX processor. The core SDCLK at the SDRAM is centered around when 


clock is the internal processor clock that is multi- the control signals change. 
| | | | | | | | 
| | | 
PCI Clock | | | | | 
| | | | | | | | 
Core Clock 
(Internal) 0 1 2 3 4 5 6 
Memory 
Clock | | . | | y | 
(Internal) 


Shift= 4 3 2 1 0 


Note: The first SDCLK shows how SDCLK operates with the SHFTSDCLK bits = 000, no shift. 
The second SDCLK shows how SDCLK operates with the SHFTSDCLK bits = 001, shift 0.5 core clock. 
(See MC_MEMCNTRL2 bits [5:3], Table 4-16 on page 123, for remaining decode values.) 


Figure 4-10 Effects of SHFTSDCLK Programming Bits Example 
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4.4 Graphics Pipeline 4.4.1 BitBLT/Vector Engine 

The graphics pipeline of the MediaGX MMX- BLTs are initiated by writing to the GP_BLT_MODE 
Enhanced processor includes a BitBLT/vector register, which specifies the type of source data 
engine which has been optimized for Microsoft® (none, frame buffer, or BLT buffer), the type of the 
Windows®. The hardware supports pattern genera- _— destination data (none, frame buffer, or BLT buffer), 
tion, source expansion, pattern/source transpar- and a source expansion flag. 


ency, and 256 ternary raster operations. The block 
diagram of the graphics pipeline is shown in Figure 


4-11. 
Scratchpad RAM 
and 
BitBLT Buffers 
C-Bus 
eS SS SS SS SS SS SS SS aS 4 
| Graphics | 
| Output Aligner Output Aligner Pipeline | 
| | 
| | 
| Pattern Source | 
| Hardware Expansion | 
Ganitol Leal Internal Bus 
| BE PAT BE SRC DST Osos Interface Unit 
| v | 
| Raster Operation | 
| | 
| | 
; DRAM Interface Healstel eccss | 
Ds eit A es Se pel Be oe ee oh ey i 
X-Bus 
Key: 


BE = Byte Enable 

PAT = Pattern Data 
SRC = Source Data 
DST = Destination Data 


Memory 
Controller 


Figure 4-11 Graphics Pipeline Block Diagram 
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The BLT buffers in the dedicated cache temporarily 
store source and destination data, typically on a 
scan line basis. The hardware automatically loads 
frame-buffer data (source or destination) into the 
BLT buffers for each scan line. The software is 
responsible to make sure that this does not over- 
flow the memory allocated for the BLT buffers. 
When the source data is a bitmap, the data is 
loaded directly into the BLT buffer before starting 
the BLT. 


Vectors are initiated by writing to the 
GP_VECTOR_MODE register 
(GX_BASE+8204h), which specifies the direction 
of the vector and a “read destination data” flag. If 
the flag is set, the hardware will read destination 
data along the vector and store it temporarily in 
BLT Buffer 0. 


4.4.2 Master/Slave Registers 


When starting a BitBLT or vector operation, the 
graphics pipeline registers are latched from the 
master registers to the slave registers. A second 
BitBLT or vector operation can then be loaded into 
the master registers while the first operation is 
rendered. If a second BLT is pending in the master 


Table 4-21 Graphics Pipeline Registers 


registers, any write operations to the graphics pipe- 
line registers will corrupt the values of the pending 
BLT. Software must prevent this from happening by 
checking the “BLT Pending” bit in the 
GP_BLT_STATUS register (GX_BASE+820Ch[2]. 


Most of the graphics pipeline registers are latched 
directly from the master registers to the slave regis- 
ters when starting a new BitBLT or vector opera- 
tion. Some registers, however, use the updated 
slave values if the master registers have not been 
written, which allows software to render successive 
primitives without loading some of the registers as 
outlined in Table 4-21. 


4.4.3 Pattern Generation 


The graphics pipeline contains hardware support 
for 8x8 monochrome patterns (expanded to two 
colors), 8x8 dither patterns (expanded to four 
colors), and 8x1 color patterns. The pattern hard- 
ware, however, does not maintain a pattern origin, 
so the pattern data must be justified before it is 
loaded into the MediaGX processor’s registers. For 
solid primitives, the pattern hardware is disabled 
and the pattern color is always sourced from the 
GP_PAT_COLOR_0 register (GX_BASE+8110h). 


Master Function 


GP_DST_XCOOR Next X position along vector. 


Master register if written, otherwise: 


Unchanged slave if BLT, source mode = bitmap. 
Slave + width if BLT, source mode = text glyph 


GP_DST_YCOOR Next Y position along vector. 


Master register if written, otherwise: 
Slave +/- height if BLT, source mode = bitmap. 
Unchanged slave if BLT, source mode = text glyph. 


GP_INIT_ERROR Master register if written, otherwise: 


Initial error for the next pixel along the vector. 


GP_SRC_YCOOR Master register if written, otherwise: 


Slave +/- height if BLT, source mode = bitmap. 
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4.4.3.1. Monochrome Patterns 


Monochrome patterns are selected by setting the 
pattern mode to 01b in the GP_RASTER_MODE 
register (GX_BASE+ 8200h). Those pixels corre- 
sponding to a clear bit (0) in the pattern are 
rendered using the color specified in the 
GP_PAT_COLOR_0 register, and those pixels 
corresponding to a set bit (1) in the pattern are 
rendered using the color specified in the 
GP_PAT_COLOR_1 register (GX_BASE+8112h). 


If the pattern transparency bit is set high in the 
GP_RASTER_MODE register, those pixels corre- 
sponding to a clear bit in the pattern data are not 
drawn. 


Monochrome patterns use bits [63:0] of the pattern 
data. Bits [7:0] correspond to the first row of the 
pattern, and bit 7 corresponds to the leftmost pixel 
on the screen. This is illustrated Figure 4-12. 


GP_PAT_DATA_0 = 0x80412214 


GP_PAT_DATA_1 = 0x08142241 


14 


22 


41 
80 
41 


22 


14 
08 


4.4.3.2 Dither Patterns 


Dither patterns are selected by setting the pattern 
mode to 10b in the GP_RASTER_MODE register 
(Table 4-25). Two bits of pattern data are used for 
each pixel, allowing color expansion to four colors. 
The colors are specified in the GP_PAT_COLOR_0 
through GP_PAT_COLOR_3 registers (Table 4- 
25). 


Dither patterns use all 128 bits of pattern data. Bits 
[15:0] correspond to the first row of the pattern (the 
lower byte contains the LSB of the pattern color 
and the upper byte contains the MSB of the pattern 
color). This is illustrated in Figure 4-13. 


GP_PAT_DATA_0 = 0x441100AA 


GP_PAT_DATA_1 = 0x115500AA 
I GP_PAT_DATA_2 = 0x441100AA 
BB Gp_PAT_DATA_3 = 0x115500AA 


OO0AA 


4411 fe! eal 


OOAA 


1155 al a 


OOAA 


4411 [] [| 


OOAA 


1155 a a 


Figure 4-12 Example of Monochrome Patterns 


Figure 4-13 Example of Dither Patterns 
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4.4.3.3. Color Patterns 


Color patterns are selected by setting the pattern 
mode to 11b in the GP_RASTER_MODE register. 
Bits [63:0] are used to hold a row of pattern data for 
an 8-BPP pattern, with bits [7:0] corresponding to 
the leftmost pixel of the row. Likewise, bits [127:0] 
are used for a 16-BPP color pattern, with bits [15:0] 
corresponding to the leftmost pixel of the row. 


To support an 8x8 color pattern, software must load 
the pattern data for each row. 


4.4.4 Source Expansion 


The graphics pipeline contains hardware support 
for color expansion of source data (primarily used 
for text). Those pixels corresponding to a clear bit 
(0) in the source data are rendered using the color 
specified in the GP_SRC_COLOR_0 register 
(GX_BASE+810Ch), and those pixels corre- 
sponding to a set bit (1) in the source data are 
rendered using the color specified in the 
GP_SRC_COLOR_1 register (GX_BASE+810Eh). 


If the source transparency bit is set in the 
GP_RASTER_MODE register, those pixels corre- 
sponding to a clear bit (0) in the source data are 
not drawn. 


4.4.5 Raster Operations 


The GP_RASTER_MODE register specifies how 
the pattern data, source data (color-expanded if 
necessary), and destination data are combined to 
produce the output from the graphics pipeline. The 
definition of the ROP value matches that of the 
Microsoft® API. This allows Windows® display 
drivers to load the raster operation directly into 
hardware. Table 4-22 illustrates this definition. 


Graphics Pipeline 


Some common raster operations are described in 
Table 4-23. 


Table 4-22 GP_RASTER_MODE Bit Patterns 


Pattern Source | Destination Output 
(bit) (bit) (bit) (bit) 
0 0) 0) ROP(0] 
0 0 1 ROP{1] 
0 1 0 ROP[2] 
0 1 1 ROP[3] 
1 0 0 ROP[4] 
1 0 1 ROP[5] 
1 1 0 ROP{[6] 
1 1 1 ROP{7] 


Table 4-23 Common Raster Operations 


ROP Description 

FOh Output = Pattern 

CCh Output = source 

5Ah Output = Pattern xor destination 
66h Output = Source xor destination 
55h Output = ~Destination 
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4.4.6 Graphics Pipeline Register “Control Registers” on page 106 for instructions on 
Descriptions accessing these registers. 

The graphics pipeline maps 200h locations starting Table 4-24 summarizes the graphics pipeline regis- 

at GX_BASE+8100h. Refer to Section 4.1.2 ters and Table 4-25 gives detailed register/bit 


Table 4-24 Graphics Pipeline Configuration Register Summary 


GX_BASE+ 
Memory Offset Type | Name / Function Default Value 
8100h-8103h R/W GP_DST/START_Y/XCOOR 00000000h 


Destination/Starting Y and X Coordinates Register — In BLT mode this register 
specifies the destination Y and X positions for a BLT operation. In Vector mode 
it specifies the starting Y and X positions in a vector. 

8104-8107h R/W GP_WIDTH/HEIGHT and GP_VECTOR_LENGTH/INIT_ERROR 00000000h 
Width/Height or Vector Length/Initial Error Register — In BLT mode this register 
specifies the BLT width and height in pixels. In Vector mode it specifies the vec- 
tor initial error and pixel length. 

8108h-810Bh R/W GP_SRC_X/YCOOR and GP_AXIAL/DIAG_ERROR 00000000h 
Source X/Y Coordinate Axial/Diagonal Error Register — In BLT mode this regis- 
ter specifies the BLT X and Y source. In Vector mode it specifies the axial and 
diagonal error for rendering a vector. 

810Ch-810Fh R/W GP_SRC_COLOR 00000000h 


Source Color Register — Determines the colors used when expanding mono- 
chrome source data in either the 8-BPP mode or the 16-BPP mode. 


8110h-8113h R/W GP_PAT_COLOR_A (8110h) and GP_PAT_COLOR_B (8114h) 00000000h 

8114h-8117h R/W Pattern Color A and B Registers — These two registers determine the colors 00000000h 
used when expanding pattern data. 

8120h-8123h R/W GP_PAT_DATA 0 through 3 00000000h 

8124h-8127h R/W Graphics Pipeline Pattern Data Registers 0 through 3 — Together these regis- 00000000h 

8128h-812Bh RW ters contain 128 bits of pattern data. 00000000h 


GP_PAT_DATA_0 corresponds to bits [31:0] of the pattern data. 


812Ch-812Fh R/W 7 00000000h 
GP_PAT_DATA_1 corresponds to bits [63:32] of the pattern data. 
GP_PAT_DATA_2 corresponds to bits [95:64] of the pattern data. 
GP_PAT_DATA_3 corresponds to bits [127:96] of the pattern data. 
8140h-8143h R/W GP_VGA_WRITE XXXXXXXXA 
(Note) Graphics Pipeline VGA Write Patch Control Register — Controls the VGA 
memory write path in the graphics pipeline. 
8144h-8147h R/W GP_VGA_READ 00000000h 
(Note) Graphics Pipeline VGA Read Patch Control Register — Controls the VGA 
memory read path in the graphics pipeline. 
8200h-8203h R/W GP_RASTER_MODE 00000000h 
Graphics Pipeline Raster Mode Register — This register controls the manipula- 


tion of the pixel data through the graphics pipeline. Refer to Section 4.4.5 “Ras- 

ter Operations” on page 138. 

Note: The registers at GX_BASE+8140, 8144h, 8210h, and 8217h are located in the area designated for the graphics pipeline but 
are used for VGA emulation purposes. Refer to Table 5-5 on page 200 for these register’s bit formats. 
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Table 4-24 Graphics Pipeline Configuration Register Summary (cont.) 


GX_BASE+ 
Memory Offset Type | Name / Function Default Value 
8204h-8207h R/W GP_VECTOR_MODE 00000000h 
Graphics Pipeline Vector Mode Register — Writing to this register initiates the 
rendering of a vector. 
8208h-820Bh R/W GP_BLT_MODE 00000000h 
Graphics Pipeline BLT Mode Register — Writing to this initiates a BLT opera- 
tion. 
820Ch-820Fh R/W GP_BLT_STATUS 00000000h 
Graphics Pipeline BLT Status Register — Contains configuration and status 
information for the BLT engine. The status bits are contained in the lower byte 
of the register. 
8210h-8213h R/W GP_VGA_BASE XXXXXXXXN 
(Note) Graphics Pipeline VGA Memory Base Address Register — Specifies the offset 
of the VGA memory, starting from the base of graphics memory. 
8214h-8217h R/W GP_VGA_LATCH XXXXXXXXN 
(Note) Graphics Pipeline VGA Display Latch Register — Provides a memory mapped 
way to read or write the VGA display latch. 
Note: The registers at GX_BASE+8140, 8144h, 8210h, and 8217h are located in the area designated for the graphics pipeline but 
are used for VGA emulation purposes. Refer to Table 5-5 on page 200 for these register’s bit formats. 
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Table 4-25 Graphics Pipeline Configuration Registers 


Bit Name Description 


GX_BASE+8100h-8103h GP_DST/START_X/YCOOR Register (R/W) Default Value = 00000000h 


31:16 | DESTINATION/STARTING Y POSITION (SIGNED): 

BLT Mode — Specifies the destination Y position for a BLT operation. 
Vector Mode — Specifies the starting Y position in a vector. 

15:0 | DESTINATION/STARTING X POSITION (SIGNED): 

BLT Mode — Specifies the destination X position for a BLT operation. 
Vector Mode — Specifies the starting X position in a vector. 


GX_BASE+8104h-8107h GP_WIDTH/HEIGHT and Default Value = 00000000h 
GP_VECTOR_LENGTH/INIT_ERROR Register (R/W) 

31:16 | PIXEL_WIDTH or VECTOR_LENGTH (UNSIGNED): 

BLT Mode — Specifies the width, in pixels, of a BLT operation. No pixels are rendered for a width of zero. 

Vector Mode — Bits [31:30] are reserved in this mode allowing this 14-bit field to specify the length, in pixels, of a vector. 
No pixels are rendered for a length of zero. This field is limited to 14 bits due to a lack of precision in the registers used to 
hold the error terms. 

15:0 | PIXEL_HEIGHT or VECTOR_INITIAL_ERROR (UNSIGNED): 

BLT Mode — Specifies the height, in pixels, of a BLT operation. No pixels are rendered for a height of zero. 

Vector Mode — Specifies the initial error for renderng a vector. 


GX_BASE+8108h-810Bh GP_SCR_X/YCOOR and GP_AXIAL/DIAG_ERROR Register (R/W) Default Value = 00000000h 


31:16 | SRC_X_POS or VECTOR_AXIAL_ERROR (SIGNED): 

BLT Mode — Specifies the source X position for a BLT operation. 

Vector Mode — Specifies the axial error for rendering a vector. 

15:0 | SRC_Y_POS or VECTOR_DIAG_ERROR (SIGNED): 

Source Y Position (Signed) — Specifies the source Y position for a BLT operation. 
Vector Mode — Specifies the diagonal error for rendering a vector. 


GX_BASE+810Ch-810Fh GP_SRC_COLOR Register (R/W) Default Value = 00000000h 


8-BPP Mode 


31:24 | GP_SRC_COLOR_0: 
23:16 | 8-BPP Color Index — The color index must be duplicated in the upper byte of GR_SRC_COLOR_0 when rendering 8- 
BPP data. 


15:8 | GP_SRC_COLOR_1: 

7:0 8-BPP Color Index — The color index must be duplicated in the upper byte of GP_SRC_COLOR_1 when rendering 8- 
BPP data. 

16-BPP Mode 


31:16 | GP_SRC_COLOR_0: 16-BPP Color (RGB) 
15:0 | GP_SRC_COLOR_1: 16-BPP Color (RGB) 

Note: The Graphics Pipeline Source Color Register specifies the colors used when expanding monochrome source data in either 
the 8-BPP mode or the 16-BPP mode. Those pixels corresponding to clear bits (0) in the source data are rendered using 
GP_SRC_COLOR_0 and those pixels corresponding to set bits (1) in the source data are rendered using 
GP_SRC_COLOR_1. 
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Table 4-25 Graphics Pipeline Configuration Registers (cont.) 


Bit Name Description 
GX_BASE+8110h-8113h GP_PAT_COLOR_A Register (R/W) Default Value = 00000000h 
8-BPP Mode 


31:24 | GP_PAT_COLOR_O: 


23:16 | 8-BPP Color Index — The color index must be duplicated in the upper byte of GP_PAT_COLOR_0 when rendering 8- 
BPP data. 


15:8 | GP_PAT_COLOR_1: 


7:0 8-BPP Color Index — The color index must be duplicated in the upper byte of GP_PAT_COLOR_1 when rendering 8- 
BPP data. 


16-BPP Mode 


31:16 | GP_PAT_COLOR_0: 16-BPP Color (RGB) 
15:0 | GP_PAT_COLOR_1: 16-BPP Color (RGB) 
Note: The Graphics Pipeline Pattern Color A and B Registers specify the colors used when expanding pattern data. 


GX_BASE+8114h-8117h GP_PAT_COLOR_B Register (R/W) Default Value = 00000000h 


8-BPP Mode 


31:24 | GP_PAT_COLOR_2: 


23:16 | 8-BPP Color Index — The color index must be duplicated in the upper byte of GP_PAT_COLOR_2 when rendering 8- 
BPP data. 


15:8 | GP_PAT_COLOR_3: 


7:0 8-BPP Color Index — The color index must be duplicated in the upper byte of GP_PAT_COLOR_3 when rendering 8- 
BPP data. 


16-BPP Mode 


31:16 | GP_PAT_COLOR_2: 16-BPP Color (RGB) 
15:0 | GP_PAT_COLOR_3: 16-BPP Color (RGB) 
Note: The Graphics Pipeline Pattern Color A and B Registers specify the colors used when expanding pattern data. 


GX_BASE+8120h-8123h GP_PAT_DATA_0 Register (R/W) Default Value = 00000000h 


31:0 | GP Pattern Data Register 0: The Graphics Pipeline Pattern Data Registers 0 through 3 together contain 128 bits of pat- 
tern data. The GP_PAT_DATA_0 register corresponds to bits [31:0] of the pattern data. 


GX_BASE+8124h-8127h GP_PAT_DATA_1 Register (R/W) Default Value = 00000000h 


31:0 | GP Pattern Data Register 1: The Graphics Pipeline Pattern Data Registers 0 through 3 together contain 128 bits of pat- 
tern data. The GP_PAT_DATA_1 register corresponds to bits [63:32] of the pattern data. 


GX_BASE+8128h-812Bh GP_PAT_DATA_2 Register (R/W) Default Value = 00000000h 


31:0 | GP Pattern Data Register 2: The Graphics Pipeline Pattern Data Registers 0 through 3 together contain 128 bits of pat- 
tern data. The GP_PAT_DATA_2 register corresponds to bits [95:64] of the pattern data. 


GX_BASE+812Ch-812Fh GP_PAT_DATA_3 Register (R/W) Default Value = 00000000h 


31:0 | GP Pattern Data Register 3: The Graphics Pipeline Pattern Data Registers 0 through 3 together contain 128 bits of pat- 
tern data. The GP_PAT_DATA_3 register corresponds to bits [127:96] of the pattern data. 
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Table 4-25 Graphics Pipeline Configuration Registers (cont.) 


Bit Name Description 


GX_BASE+8140h-8143h GP_VGA_WRITE Register (R/W) Default Value = xxxxxxxxh 


Note that the registers at GX_BASE+82140h and 8144h are located in the area designated for the graphics pipeline but are used 
for VGA emulation purposes. Refer to Table 5-5 on page 200 for these register’s bit formats. 


GX_BASE+8144h-8147h GP_VGA_READ Register (R/W) Default Value = 00000000h 


Note that the registers at GX_BASE+82140h and 8144h are located in the area designated for the graphics pipeline but are used 
for VGA emulation purposes. Refer to Table 5-5 on page 200 for these register’s bit formats. 


GX_BASE+8200h-8203h GP_RASTER_MODE Register (R/W) Default Value = 00000000h 
31:13 RSVD Reserved: Set to 0. 
12 TB Transparent BLIT: When set, this bit enables transparent BLIT. The source color data will be compared to 


a color key and if it matches, that pixel will not be drawn. The color key value is stored in the BLIT buffer as 
destination data. The raster operation must be set to C6h, and the pattern registers must be all F’s for this 
mode to work properly. 


11 ST Source Transparency: Enables transparency for monochrome source data. Those pixels corresponding 
to clear bits in the source data are not drawn. 

10 PT Pattern Transparency: Enables transparency for monochrome pattern data. Those pixels corresponding 
to clear bits in the pattern data are not drawn. 

9:8 PM Pattern Mode: Specifies the format of the pattern data. 


00 = Indicates a solid pattern. The pattern data is always sourced from the GP_PAT_COLOR_0 register. 


01 = Indicates a monochrome pattern. The pattern data is sourced from the GP_PAT_COLOR_0 and 
GP_PAT_COLOR_1 registers. 


10 = Indicates a dither pattern. All four pattern color registers are used. 
11 =Indicates a color pattern. The pattern data is sourced directly from the pattern data registers. 


7:0 ROP Raster Operation: Specifies the raster operation for pattern, source, and destination data. 
GX_BASE+8204h-8207h GP_VECTOR_MODE Register (R/W) Default Value = 00000000h 
31:4 RSVD Reserved: Set to 0. 
3 DEST Read Destination Data: Indicates that frame-buffer destination data is required. 
2 DMIN Minor Direction: Indicates a positive minor axis step. 
1 DMAJ Major Direction: Indicates a positive major axis step. 
0 YMAJ Major Direction: Indicates a Y Major vector. 
GX_BASE+8208h-820Bh GP_BLT_MODE Register (R/W) Default Value = 00000000h 
31:9 RSVD Reserved: Set to 0. 
8 Y Reverse Y Direction: Indicates a negative increment for the Y position. This bit is used to control the 
direction of screen to screen BLTs to prevent data corruption in overlapping windows. 
7:36 SM Source Mode: Specifies the format of the source data. 


00 = Source is a color bitmap. 
01 = Source is a monochrome bitmap (use source color expansion). 
10 = Unused. 


11 = Source is a text glyph (use source color expansion). This differs from a monochrome bitmap in that 
the X position is adjusted by the width of the BLT and the Y position remains the same. 
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Table 4-25 Graphics Pipeline Configuration Registers (cont.) 


Bit 


Name 


Description 


5 


RSVD 


Reserved: Set to 0. 


4:2 


RD 


Destination Data: Specifies the destination data location. 

000 = No destination data is required. The destination data into the raster operation unit is all ones. 
010 = Read destination data from BLT Buffer 0. 

011 = Read destination data from BLT Buffer 1. 

100 = Read destination data from the frame buffer (store temporarily in BLT Buffer 0). 

101 = Read destination data from the frame buffer (store temporarily in BLT Buffer 1). 


1:0 


RS 


Source Data: Specifies the source data location. 

00 = No source data is required. The source data into the raster operation unit is all ones. 
01 = Read source data from the frame buffer (temporarily stored in BLT Buffer 0). 

10 = Read source data from BLT Buffer 0. 

11 = Read source data from BLT Buffer 1. 


GX_BASE+820Ch-820Fh 


GP_BLT_STATUS Register (R/W) Default Value = 00000000h 


31:10 RSVD Reserved: Set to 0. 
9 WwW Screen Width: Selects a frame-buffer width of 2048 bytes (default is 1024 bytes). 
8 M 16-BPP Mode: Selects a pixel data format of 16 BPP (default is 8 BPP). 
7:3 RSVD Reserved: Set to 0. 
2 BP (RO) | BLT Pending (Read Only): Indicates that a BLT operation is pending in the master registers. 
The “BLT Pending” bit must be clear before loading any of the graphics pipeline registers. Loading regis- 
ters when this bit is set high will destroy the values for the pending BLT. 
1 PB (RO) _ | Pipeline Busy (Read Only): Indicates that the graphics pipeline is processing data. 
The “Pipeline Busy” bit differs from the “BLT Busy” bit in that the former only indicates that the graphics 
pipeline is processing data. The “BLT Busy” bit also indicates that the memory controller has not yet pro- 
cessed all of the requests for the current operation. 
The “Pipeline Busy” bit must be clear before loading a BLT buffer if the previous BLT operation used the 
same BLT buffer. 
0 BB (RO) _ | BLT Busy (Read Only): Indicates that a BLT / vector operation is in progress. 


The “BLT Busy” bit must be clear before accessing the frame buffer directly. 


GX_BASE+8210h-8213h 


Note that the registers at GX_BASE+8210h and 8214h are located in the area designated for the graphics pipeline but are used for 
VGA emulation purposes. Refer to Table 5-5 on page 200 for these register’s bit formats. 


GGP_VGA_BASE (R/W) Default Value = xxxxxxxxh 


GX_BASE+8214h-8217h 


Note that the registers at GX_BASE+8210h and 8214h are located in the area designated for the graphics pipeline but are used for 
VGA emulation purposes. Refer to Table 5-5 on page 200 for these register’s bit formats. 


GP_VGA_LATCH Register (R/W) Default Value = xxxxxxxxh 
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4.5 Display Controller sion/decompression (CODEC) hardware, hard- 
The MediaGX MMX-Enhanced processor incorpo- Ware cursor, a 256-entry-by-18-bit palette RAM 
rates a display controller that retrieves display data _ (Plus three extension colors), display timing gener- 


from the memory controller and formats it for ator, dither and frame-rate-modulation circuitry for 
output on a variety of display devices. The Tem, panels, and flexible output formatting logic. A 
MediaGX processor can directly connect to an diagram of the display controller subsystem is 
active matrix TFT LCD flat panel or to an external shown in Figure 4-14. 


RAMDAC for CRT display or both. The display 
controller includes a display FIFO, compres- 


Compressed 
Line Buffer 
(64x32 bit) 


Output 
Format 


Display 
FIFO 
(64x64 bit) 


Pseudo/True 
Color Mux 


Palette Data 


Memory 
Address 
Generator 


Control Registers 
and 
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Memory 
Address 


aot Output 
minnie Control 


Generator 


Figure 4-14 Display Controller Block Diagram 
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4.5.1 Display FIFO 


The display controller contains a large (64x64 bit) 
FIFO for queuing up display data from the memory 
controller as it is required for output to the screen. 
The memory controller must arbitrate between the 
display controller requests and other requests for 
memory access from the microprocessor core, L1 
cache controller, and the graphics pipeline. 


Since display data is required in real time, this data 
is the highest priority in the system. Without effi- 
cient memory management, system performance 
would suffer dramatically due to the constant 
display-refresh requests from the display controller. 
The large size of the display FIFO is desirable so 
that the FIFO may primarily be loaded during times 
when there is no other request pending to the 
DRAM controller and so that the memory controller 
can stay in page mode for a long period of time 
when servicing the display FIFO. When a priority 
request from the cache or graphics pipeline occurs, 
if the display FIFO has enough data queued up, 
the DRAM controller can immediately service the 
request without concern that the display FIFO will 
underflow. If the display FIFO is below a program- 
mable threshold, a high-priority request will be sent 
to the DRAM controller, which will take precedence 
over any other requests that are pending. 


The display FIFO is 64 bits wide to accommodate 
high-speed burst read operations from the DRAM 
controller at maximum memory bandwidth. In addi- 
tion to the normal pixel data stream, the display 
FIFO also queues up cursor patterns. 


Display Controller 


4.5.2 Compression Technology 


To reduce the system memory contention caused 
by the display refresh, the display controller 
contains compression and decompression logic for 
compressing the frame buffer image in real time as 
it is sent to the display. It combines this 
compressed display buffer into the extra off-screen 
memory within the graphics memory aperture. 
Coherency of the compressed display buffer is 
maintained by use of dirty and valid bits for each 
line. The dirty and valid RAM is contained on-chip 
for maximum efficiency. Whenever a line has been 
validly compressed, it will be retrieved from the 
compressed display buffer for all future accesses 
until the line becomes dirty again. Dirty lines will be 
retrieved from the normal uncompressed frame 
buffer. 


The compression logic has the ability to insert a 
programmable number of "static" frames, during 
which time dirty bits are ignored and the valid bits 
are read to determine whether a line should be 
retrieved from the frame buffer or compressed 
display buffer. The less frequently the dirty bits are 
sampled, the more frequently lines will be retrieved 
from the compressed display buffer. This allows a 
programmable screen image update rate (as 
opposed to refresh rate). Generally, an update rate 
of 30 frames per second is adequate for displaying 
most types of data, including real- time video. 
However, if a flat panel display is used that has a 
slow response time, such as 100ms, the image 
need not be updated faster than ten frames per 
second, since the panel could not display changes 
beyond that rate. 


Page 146 


Cyrix Corporation Confidential 


GXm_db_v2.0 


Display Controller A | 


The compression algorithm used in the MediaGX 
processor commonly achieves compression ratios 
between 10:1 and 20:1, depending on the nature of 
the display data. This high level of compression 
provides higher system performance by reducing 
typical latency for normal system memory access, 
higher graphics performance by increasing avail- 
able drawing bandwidth to the DRAM array, and 
much lower power consumption by significantly 
reducing the number of off-chip DRAM accesses 
required for refreshing the display. These advan- 
tages become even more pronounced as display 
resolution, color depth, and refresh rate are 
increased and as the size of the installed DRAM 
increases. 


As uncompressed lines are fed to the display, they 
will be compressed and stored in an on-chip 
compressed line buffer (64x32 bits). Lines will not 
be written back to the compressed display buffer in 
the DRAM unless a valid compression has 
resulted, so there is no penalty for pathological 
frame buffer images where the compression algo- 
rithm breaks down. 


4.5.3 Motion Video Acceleration 


Support 


The display controller of the MediaGX processor 
supports the Cx5520 hardware motion video accel- 
eration by reading the off-screen video buffer and 
serializing the video data onto the RAMDAC port. 
The display controller supplies video data to the 
Cx5520 in either interleaved YUV4:2:2 format or 
RGB5:6:5 format. The Cx5520 can then scale and 
filter the data, apply color space conversion to YUV 
data, and mix the video data with graphics data, 
also supplied by the display controller. 


4.5.4 Hardware Cursor 


The display controller contains hardware cursor 
logic to allow overlay of the cursor image onto the 
pixel data stream. Overhead for updating this 
image on the screen is kept to a minimum by 
requiring that only the X and Y position be 
changed. This eliminates "submarining" effects 
commonly associated with software cursors. The 
cursor, 32x32 pixels with two bits per pixel, is 
loaded into off-screen memory within the graphics 
memory aperture. The two-bit code selects color 0, 
color 1, transparent, or background-color inversion 
for each pixel in the cursor (see Table 4-31 on 
page 165). The two cursor colors will be stored as 
extensions to the normal 256-entry palette at loca- 
tions 100h and 101h. These palette extensions will 
be used when driving a flat panel or a RAMDAC 
operating in 16 BPP (bits per pixel) mode. For 8 
BPP operation using an external RAMDAC, the 
DC_CURSOR_COLOR register 
(GX_BASE+8360h) should be programmed to set 
the indices for the cursor colors. To avoid corrup- 
tion of the cursor colors by an application program 
that modifies the external palette, care should be 
taken to program the cursor color indices to one of 
the static color indices. Since Microsof® 
Windows® typically uses only black and white 
cursor colors and these are static colors, this kind 
of problem should rarely occur. 
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4.5.5 Display Timing Generator 


The display controller features a fully program- 
mable timing generator for generating all timing 
control signals for the display. The timing control 
signals include horizontal and vertical sync and 
blank signals in addition to timing for active and 
overscan regions of the display. The timing gener- 
ator is similar in function to the CRTC of the orig- 
inal VGA, although programming is more 
straightforward. Programming of the timing regis- 
ters will generally happen via a BIOS INT10 call 
during a mode set. When programming the timing 
registers directly, extreme care should be taken to 
ensure that all timing is compatible with the display 
device. 


The timing generator supports overscan to main- 
tain full backward compatibility with the VGA. This 
feature is supported primarily for CRT display 
devices since flat panel displays have fixed resolu- 
tions and do not provide for overscan. However, 
the MediaGX processor supports a mechanism to 
center the display when a display mode is selected 
having a lower resolution than the panel resolution. 
The border region is effectively stretched to fill the 
remainder of the screen. The border color is at 
palette extension 104h. For 8 BPP operation with 
an external RAMDAC, the DC_BORDER_COLOR 
register (GX_BASE+8368h) should also be 
programmed. 


4.5.6 Dither and Frame-Rate 


Modulation 


The display controller supports 2x2 dither and two- 
level frame-rate modulation (FRM) to increase the 
apparent number of colors displayed on 9-bit or 12- 
bit TFT panels. Dither and FRM are individually 
programmable. With dithering and FRM enabled, 
185,193 colors are possible on a 9-bit TFT panel, 
and 226,981 colors are possible on a 12-bit TFT 
panel. 


Display Controller 


4.5.7 Display Modes 


The MediaGX processor has two graphics output 
ports: one primarily designed for interfacing to 
Thin-Film-Transistor (TFT) flat-panel displays and 
the other primarily designed for interfacing to a 
RAMDAC that drives a CRT display. By having two 
separate ports, systems that contain both a TFT 
panel and a CRT port can be designed with a 
minimum of external devices. In addition, simulta- 
neous display configurations can be supported with 
optimum display quality on both display devices. 
The RAMDAC bus can be driven with 8 BPP 
indexed data to the palette in the RAMDAC while 
the TFT is driven with the appropriate true-color 
data that has already been frame-rate modulated 
and dithered if necessary. Display modes for the 
TFT port are supported and shown in Table 4-26. 
The PANEL data bus may also serve as a 
secondary RAMDAC output port for desktop 
systems that incorporate a 16-bit-pixel-port 
RAMDAC. The MediaGX processor supports 
multiple output data formats for interfacing to 
various TFT displays and RAMDACs in various 
display modes. The output formats supported are 
shown in Table 4-27 and Table 4-28. 


The MediaGX processor supports 640x480, 
800x600, and 1024x768 display resolutions at both 
8 and 16 bits per pixel. In addition, 1280x1024 
resolution is supported at 8 bits per pixel only. Two 
16-bit display formats are supported: RGB 5-6-5 
and RGB 5-5-5. Simultaneous display is supported 
for TFT panels and CRTs at 640x480 and 800x600 
resolution. All CRT modes use VESA-compatible 
timing. Table 4-29 gives the supported CRT display 
modes. 


The PANEL output port and RAMDAC output port 
can be individually configured to allow independent 
operation. It is possible to run the RAMDAC inter- 
face with 8 BPP indexed data while driving true- 
color data to the panel. It is also possible to run the 
RAMDAC interface in a clock-doubled fashion 
while operating the PANEL data bus in a single- 
clocked fashion. 
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The MediaGX processor supports both 8- and 16- port. RAMDACS with 8-bit pixel ports will be able to 


bit RAMDAC configurations, and a direct connec- support 16 BPP displays only up to 800x600 reso- 
tion to a TFT. For systems that utilize a direct lution. For configurations that utilize a 16-bit 
connection to a TFT display and RAMDAC, only RAMDAC with no TFT attached, resolutions up to 
eight bits of data are provided to the RAMDAC 1024x768 can be supported at 16 BPP. 


Table 4-26 TFT Panel Display Modes 


Refresh | DOTCLK 
Simultaneous Rate Rate PCLK Panel Maximum Displayed 
Resolution Colors (Hz) (MHz) (MHz) Type Colors (Note 1) 
640x480 8 BPP 60 25.175 25.175 | 9-bit 579 = 185,193 
(Note 2) 256 colors out of a al 3 
palette of 256 12-bit 61° = 226,981 
18-bit 43 = 262,144 
16 BPP 60 25.175 25.175 | 9-bit 29x57x29 = 47,937 
or 12-bit 31x61x31 = 58,621 
18-bit 32x64x32 = 65,535 
800x600 8 BPP 60 40.0 40.0 9-bit 579 = 185,193 
(Note 2) 256 colors out of a a 3 
palette of 256 12-bit 61° = 226,981 
18-bit 643 = 262,144 
16 BPP 60 40.0 40.0 9-bit 29x57x29 = 47,937 
Soe 12-bit 31x61x31 = 58,621 
18-bit 32x64x32 = 65,535 
1024x768 8 BPP 60 65 32.5 9-bit/18-/F | 57° = 185,193 
256 colors out of a 
palette of 256 
16 BPP 60 65 32.5 9-bit/18-I/F | 29x57x29 = 47,937 
64K colors 
5-6-5 


Notes: 1) 9-bit and 12-bit panels use FRM and dither to increase displayed colors. (See Section 4.5.6 “Dither and 
Frame-Rate Modulation” on page 148.) 


2) All 640x480 and 800x600 modes can be run in simultaneous display with CRT. 
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Table 4-27 TFT Panel Data Bus Formats 


Panel 9-Bit TFT 16-Bit RAMDAC 
Data Bus 18-Bit 12-Bit 16 BPP, 1280x1024x8 BPP, 

Bit TFT TFT 640x480 1024x768 Upper Half of Pixel 2nd Pixel 
17 R5 R5 R5 R5 Even R5 P7 
16 R4 R4 R4 R4 R4 P6 
15 R3 R3 R3 R3 R3 P5 
14 R2 R2 R5 Odd R2 P4 
13 R1 R4 R1 P3 
12 RO R3 

11 G5 G5 G5 G5 Even G5 P2 
10 G4 G4 G4 G4 G4 P1 
9 G3 G3 G3 G3 G3 PO 
8 G2 G2 G5 Odd 

7 G1 G4 

6 GO G3 

5 B5 B5 B5 B5 Even 

4 B4 B4 B4 B4 

3 B3 B3 B3 B3 

2 B2 B2 B5 Odd 

1 B1 B4 

0 BO B3 


Table 4-28 CRT RAMDAC Data Bus Formats 


. ‘ 8-Bit RAMDAC 16-Bit RAMDAC 
8- or 16-Bit 16-Bit 16 BPP 16 BPP 
RAMDAC RAMDAC, RAMDAC, 

Data 8 BPP 1280x1024x8 BPP, First Second Lower Half 

Bus Indexed Output First Pixel Transfer Transfer Of Pixel Video” 
7 P7 P7 G2 R5 G2 V7 
6 P6 P6 G1 R4 G1 V6 
5 P5 P5 GO R3 GO V5 
4 P4 P4 B5 R2 B5 V4 
3 P3 P3 B4 R1 B4 V3 
2 P2 P2 B3 G5 B3 V2 
1 P1 P1 B2 G4 B2 v1 
0 PO PO Bi G3 Bi Vo 


Note: “*Refer to the Cx5520 or Cx5530 Data Book for details on YUV ordering. 
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Table 4-29 CRT Display Modes 


Refresh Rate | DOTCLK Rate PCLK Graphics Port 
Resolution Colors (Hz) (MHz) (MHz) Width (Bits) 
640x480 8 BPP 60 25.175 25.175 8 
256 colors out of a 
palette of 256 72 31.5 31.5 8 
75 31.5 31.5 8 
16 BPP 60 25.175 50.35 8 
64 K colors 
RGB 5-6-5 25.175 16 
72 31.5 63.0 8 
31.5 16 
75 31.5 63.0 8 
31.5 16 
800x600 8 BPP 60 40.0 40.0 8 
256 colors out of a 
palette of 256 te au ee i 
75 49.5 49.5 8 
16 BPP 60 40.0 80.0 8 
64 K colors 
RGB 5-6-5 40.0 16 
72 50.0 100 8 
50.0 16 
75 49.5 99 8 
49.5 16 
1024x768 8 BPP 60 65.0 65.0 8 
256 colors out of a 
palette of 256 70 75.0 75.0 8 
75 78.5 78.5 8 
16 BPP 60 65.0 65.0 16 
ef eos 70 75.0 75.0 16 
RGB 5-6-5 
75 78.5 78.5 16 
1280x1024 8 BPP 60 108.0 108.0 8 
256 colors out of a 
4. 1 
palette of 256 ong $ 
75 135.0 67.5 16 
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4.5.8 Graphics Memory Map 


The MediaGX processor supports a maximum of 
4MB of graphics memory and will map it to an 
address space (see Figure 4-2 on page 105) 
higher than the maximum amount of installed 
RAM. The graphics memory aperture physically 
resides at the top of the installed system RAM. The 
start address and size of the graphics memory 
aperture are programmable on 128KB boundaries. 
Typically, the system BIOS sets the size and start 
address of the graphics memory aperture during 
the boot process based on the amount of installed 
RAM, user defined CMOS settings, and display 
resolution. The graphics pipeline and display 
controller address the graphics memory with a 20- 
bit offset (address bits [21:2]) and four byte 
enables into the graphics memory aperture. The 
graphics memory stores several buffers that are 
used to generate the display: the frame buffer, 
compressed display buffer, VGA memory, and 
cursor pattern(s). Any remaining off-screen 
memory within the graphics aperture may be used 
by the display driver as desired or not at all. 


Display Controller 


4.5.8.1 DC Memory Organization 
Registers 


The display controller contains a number of regis- 
ters that allow full programmability of the graphics 
memory organization. This includes starting offsets 
for each of the buffer regions described above, line 
delta parameters for the frame buffer and compres- 
sion buffer, as well as compressed line-buffer size 
information. The starting offsets for the various 
buffers are programmable for a high degree of flex- 
ibility in memory organization. 


4.5.8.2 Frame Buffer and 
Compression Buffer 
Organization 

The MediaGX processor supports primary display 

modes 640x480, 800x600, and 1024x768 at both 8 

BPP and 16 BPP, and 1280x1024 at 8 BPP. Pixels 

will be packed into DWORDs as shown in Figure 4- 

15. 


(0, 0) (1023,0) 


DWORD 0 Sear 


(0, 1023) (1023, 1023) 
Bit Position 31|30|29|28/27|26/25|24/23/22)21 |20/1 9/18/14 7|/16/15|14/13/12)11/10/9 |8/7/6)/5/4)3/2)/1)0 
Address 3h 2h th Oh 
Pixel Org - 8 BPP (3,0) (2,0) (1,0) (0,0) 
Pixel Org - 16 BPP (1,0) (0,0) 
Figure 4-15 Pixel Arrangement Within a DWORD 
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In order to simplify address calculations by the 
rendering hardware, the frame buffer is organized 
in an XY fashion where the offset is simply a 
concatenation of the X and Y pixel addresses. All 8 
BPP display modes with the exception of 
1280x1024 resolution will use a 1024-byte line 
delta between the starting offsets of adjacent lines. 
All 16 BPP display modes and 1280x1024x8 BPP 
display modes will use a 2048-byte line delta 
between the starting offsets of adjacent lines. If 
there is room, the space between the end of a line 
and the start of the next line will be filled with the 
compressed display data for that line, thus allowing 
efficient memory utilization. For 1024x768 display 
modes, the frame-buffer line size is the same as 
the line delta, so no room is left for the compressed 
display data between lines. In this case, the 
compressed display buffer begins at the end of the 
frame buffer region and is linearly mapped. 


4.5.8.3. VGA Display Support 


The graphics pipeline contains full hardware 
support for the VGA front end. The VGA data is 
stored in a 256KB buffer located in graphics 
memory. The main task for SoftVGA is converting 
the data in the VGA buffer to an 8 BPP frame buffer 
that can be displayed by the MediaGX processor’s 
hardware. 


For some modes, the display controller can display 
the VGA data directly and the data conversion is 
not necessary. This includes standard VGA mode 
13h and the variations of that mode used in several 
games; the display controller can also directly 
display VGA planar graphics modes D, E, F, 10, 11, 
and 12. Likewise, the hardware can directly display 
all of the higher-resolution VESA modes. Since the 
frame buffer data is written directly to memory 
instead of travelling across an external bus, the 
MediaGX processor outperforms typical VGA 
cards for these modes. 


The display controller, however, does not directly 
support text modes. SoftVGA must then convert 
the characters and attributes in the VGA buffer to 
an 8 BPP frame buffer the hardware uses for 
display refresh. See Section 4 “Virtual Subsystem 
Architecture” for SoftVGA details. 


4.5.8.4 Cursor Pattern Memory 
Organization 


The cursor overlay patterns are loaded to indepen- 
dent memory locations, usually mapped above the 
frame buffer and compressed display buffer (off- 
screen). The cursor buffer must start on a 16-byte 
aligned boundary. It is linearly mapped, and is 
always 256 bytes in size. If there is enough room 
(256 bytes) after the compression-buffer line but 
before the next frame-buffer line starts, the cursor 
pattern may be loaded into this area to make effi- 
cient use of the graphics memory. 


Each pattern is a 32x32-pixel array of 2-bit codes. 
The codes are a combination of AND mask and 
XOR mask for a particular pixel. Each line of an 
overlay pattern is stored as two DWORDs, with 
each DWORD containing the AND masks for 16 
pixels in the upper word and the XOR masks for 16 
pixels in the lower word. DWORDs are arranged 
with the leftmost pixel block being least significant 
and the rightmost pixel block being most signifi- 
cant. Pixels within words are arranged with the left- 
most pixels being most significant and the 
rightmost pixels being least significant. 


Multiple cursor patterns may be loaded into the off- 
screen memory. An application may simply change 
the cursor start offset to select a new cursor 
pattern. The new cursor pattern will be used at the 
start of the next frame scan. 
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4.5.9 Display Controller Registers + Memory Organization Registers 

The Display Controller maps 100h locations * Timing Registers 

starting at GX_BASE+8300h. Refer to Section * Cursor and Line Compare Registers 

4.1.2 “Control Registers” on page 106 for instruc- * Color Registers 

tions on accessing these registers. « Palette and RAM Diagnostic Registers 

The Display Controller Registers are divided into Table 4-30 summarizes these registers and loca- 
six categories: tions and the following subsections give detailed 
* Configuration and Status Registers register/bit formats. 


Table 4-30 Display Controller Register Summary 


GX_BASE+ Default 
Memory Offset | Type | Name/Function Value 


Configuration and Status Registers 


8300h-8303h R/W_ | DC_UNLOCK 00000000h 
Display Controller Unlock — This register is provided to lock the most critical memory- 
mapped display controller registers to prevent unwanted modification (write operations). 
Read operations are always allowed. 


8304h-8307h R/W_ | DC_GENERAL_CFG 00000000h 
Display Controller General Configuration — General control bits for the display controller. 
8308h-830Bh R/W_ | DC_TIMING_CFG xx000000h 


Display Controller Timing Configuration — Status and control bits for various display 
timing functions. 
830Ch-830Fh R/W | DC_OUTPUT_CFG xx000000h 


Display Controller Output Configuration — Status and control bits for pixel output 
formatting functions. 


Memory Organization Registers 


8310h-8313h R/W | DC_FB_ST_OFFSET XXXXXXXXN 
Display Controller Frame Buffer Start Address — Specifies offset at which the frame 
buffer starts. 

8314h-8317h R/W_ | DC_CB_ST_OFFSET XXXXXXXxXN 
Display Controller Compression Buffer Start Address — Specifies offset at which the 
compressed display buffer starts. 

8318h-831Bh R/W DC_CURS_ST_OFFSET XXXXXXXXN 


Display Controller Cursor Buffer Start Address — Specifies offset at which the cursor 
memory buffer starts. 


831Ch-831Fh -- Reserved 00000000h 

8320h-8323h R/W_ | DC_VID_ST_OFFSET XXXXXXXXN 
Display Controller Video Start Address — Specifies offset at which the video buffer starts. 

8324h-8327h R/W | DC_LINE_DELTA XXXXXXXXN 
Display Controller Line Delta — Stores line delta for the graphics display buffers. 

8328h-832Bh R/W_ | DC_BUF_SIZE XXXXXXXXN 


Display Controller Buffer Size — Specifies the number of bytes to transfer for a line of 
frame buffer data and the size of the compressed line buffer. 


832Ch-832Fh os Reserved 00000000h 
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Table 4-30 Display Controller Register Summary (cont.) 


GX_BASE+ Default 
Memory Offset | Type | Name/Function Value 


Timing Registers 


8330h-8333h R/W DC_H_TIMING_1 XXXXXXXXN 
Display Controller Horizontal and Total Timing — Horizontal active and total timing 
information. 

8334h-8337h R/W_ | DC_H_TIMING_2 XXXXXXXXN 
Display Controller CRT Horizontal Blanking Timing — CRT horizontal blank timing 
information. 

8338h-833Bh R/W_ | DC_H_TIMING_3 XXXXXXXxXN 
Display Controller CRT Sync Timing — CRT horizontal sync timing information. 
833Ch-833Fh R/W DC_FP_H_TIMING XXXXXXXXN 
Display Controller Flat Panel Horizontal Sync Timing: Horizontal sync timing information 
for an attached flat panel display. 

8340h-8343h R/W_ | DC_V_TIMING_1 XXXXXXXXN 
Display Controller Vertical and Total Timing — Vertical active and total timing information. 
The parameters pertain to both CRT and flat panel display. 

8344h-8247h R/W_ | DC_V_TIMING_2 XXXXXXXXN 
Display Controller CRT Vertical Blank Timing — Vertical blank timing information. 
8348h-834Bh R/W_ | DC_V_TIMING_3 XXXXXXXXN 
Display Controller CRT Vertical Sync Timing — CRT vertical sync timing information. 
834Ch-834Fh R/W_ | DC_FP_V_TIMING XXXXXXXXN 


Display Controller Flat Panel Vertical Sync Timing — Flat panel vertical sync timing 
information. 


Cursor and Line Compare Registers 


8350h-8353h R/W | DC_CURSOR_X XXXXXXXXN 
Display Controller Cursor X Position — X position information of the hardware cursor. 
8354h-8357h RO DC_V_LINE_CNT XXXXXXXXN 


Display Controller Vertical Line Count — This read only register provides the current 
scanline for the display. It is used by software to time update of the frame buffer to avoid 
tearing artifacts. 


8358h-835Bh R/W DC_CURSOR_Y XXXXXXXXN 
Display Controller Cursor Y Position — Y position information of the hardware cursor. 
835Ch-835Fh R/W |DC_SS LINE_CMP XXXXXXXXN 


Display Controller Split-Screen Line Compare — Contains the line count at which the 
lower screen begins in a VGA split-screen mode. 


Color Registers 


8360h-8363h R/W_ | DC_CURSOR_COLOR XXXXXXXxXN 
Display Controller Cursor Color — Contains the 8-bit indices for the cursor colors. 

8364h-8367h -- Reserved 00000000h 

8368h-836Bh R/W DC_BORDER_COLOR XXXXXXXXN 
Display Controller Border Color — Contains the 8-bit index for the border or overscan 
color. 

836Ch-836Fh -- Reserved 00000000h 
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Table 4-30 Display Controller Register Summary (cont.) 


GX_BASE+ Default 
Memory Offset | Type | Name/Function Value 


Palette and RAM Diagnostic Registers 


8370h-8373h R/W_ | DC_PAL_ADDRESS XXXXXXXXN 


Display Controller Palette Address — This register should be written with the address 
(index) location to be used for the next access to the DC_PAL_DATA register. 


8374h-8377h R/W DC_PAL_DATA XXXXXXXXA 
Display Controller Palette Data — Contains the data for a palette access cycle. 
8378h-837Bh R/W_ | DC_DFIFO_DIAG XXXXXXXXN 


Display Controller Display FIFO Diagnostic — This register is provided to enable testabil- 
ity of the Display FIFO RAM. 


837Ch-837Fh R/W_ | DC_CFIFO_DIAG XXXXXXXXN 


Display Controller Compression FIFO Diagnostic — This register is provided to enable 
testability of the Compressed Line Buffer (FIFO) RAM. 
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4.5.9.1 Configuration and Status * Display Controller General Configuration 
Registers (DC_GENERAL_CFG) 

The Configuration and Status Registers group - General control bits for the display controller. 

consists of four 32-bit registers located at ¢ Display Controller Timing Configuration 

GX_BASE+8300h-830Ch. These registers are (DC_TIMING_CFG) 

described below and Table 4-31 gives their bit 


- Status and control bits for various display 


formats. timing functions. 


* Display Controller Unlock (DC_UNLOCK) * Display Controller Output Configuration 
- This register is provided to lock the most crit- (DC_OUTPUT_CFG) 
ical memory-mapped display controller regis- 
ters to prevent unwanted modification (write 
operations). Read operations are always 
allowed. 


- Status and control bits for pixel output format- 
ting functions. 


Table 4-31 Display Controller Configuration and Status Registers 


Bit Name Description 
GX_BASE+8300h-8303h DC_UNLOCK Register (R/W) Default Value = 00000000h 
31:16 RSVD Reserved: Set to 0. 
15:0 UNLOCK__ | Unlock Code: This register must be written with the value 4758h in order to write to the protected regis- 
CODE ters. The following registers are protected by the locking mechanism. 
DC_GENERAL_CFG DC_CB_ST_OFFSET, 
DC_BUF_SIZE, DC_V_TIMING_2 
DC_TIMING_CFG, DC_CURS_ST_OFFSET, 
DC_H_TIMING_1, DC_V_TIMING_3 
DC_OUTPUT_CFG, DC_H_TIMNG_2, 
DC_FP_H_TIMING DC_FB_ST_OFFSET, 
DC_LINE_DELTA, DC_FP_V_TIMING 
GX_BASE+8304h-8307h DC_GENERAL_CFG (R/W) Default Value = 00000000h 

31 DDCK Divide Dot Clock: Divide internal DOTCLK by two relative to PCLK (pertains only to 16 BPP display 
modes utilizing an eight-bit RAMDAC): 0 = Disable; 1 = Enable. 

30 DPCK Divide Pixel Clock: Divide PCLK by two relative to internal DOTCLK (pertains only to display modes 
that pack two pixels together such as 1280x1024 on an external CRT only): 0 = Disable; 1 = Enable. 

29 VRDY Video Ready Protocol: 0 = Low speed video port, use with V2.3 and older. 
1 = High speed video port, use with V2.4 and newer. 

28 VIDE Video Enable: Motion video port: 0 = Disable; 1 = Enable. 

27 SSLC Split-screen Line Compare: VGA line compare function: 0 = Disable; 1 = Enable. 
When enabled, the internal line counter will be compared to the value programmed in the DC_SS 
_LINE_CMP register. If it matches, the frame buffer address will be reset to zero. This enables a split 
screen function. 

26 CH4S Chain 4 Skip: Allow display controller to read every 4th DWORD from the frame buffer for compatibility 
with the VGA: 0 = Disable; 1 = Enable. 

25 DIAG FIFO Diagnostic Mode: This bit allows testability of the on-chip Display FIFO and Compressed Line 
Buffer via the diagnostic access registers. A low-to-high transition will reset the Display FIFO’s R/W 
pointers and the Compressed Line Buffer’s read pointer. 0 = Normal operation; 1 = Enable. 
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Table 4-31 Display Controller Configuration and Status Registers (cont.) 


Bit Name Description 


24 LDBL Line Double: Allow line doubling for emulated VGA modes: 0 = Disable; 1 = Enable. 

If enabled, this will cause each odd line to be replicated from the previous line as the data is sent to the 
display. Timing parameters should be programmed as if no pixel doubling is used, however, the frame 
buffer should be loaded with half the normal number of lines. 

23 CKWR Clock Write: This bit will be output directly to an external clock chip or SYNDAC. The bit should be 
pulsed high and low by the software to strobe data into the chip. 

Note that this bit can be used in conjunction with the DACRS[2:0] pins. 

22:20 | DAC_RS[2:0] | RAMDAC Register Selects: This 3-bit field sets the register select inputs to the external RAMDAC for 
the next cycle. It is used to allow access to the extended register set of the RAMDAC. Alternatively, 
these bits may be used in selecting the frequency for an external clock chip or SYNDAC. If more than 
eight frequency selections are required, the RAMDAC extended register programming sequence must 
be used or the additional select bit must be provided by some other means. 


19 RTPM Real-Time Performance Monitoring: Allows real-time monitoring of a variety of internal MediaGX 
processor signals by multiplexing the signals onto the CLKWR and DACRS[2:0] pins: 

0 = Disable (Normal operation); 1 = Enable. 

The CLKWR pin should not be fed to a clock chip or SYNDAC when this mode of operation is used, a 
different programming scheme should be used for the clock chip using the DACRS[2:0] signals and 
RAMDACRD# and RAMDACWR3# signals. The selection of output signals is made using bits [27:16] of 
the DC_BUF_SIZE register. The lower 12 bits of this field will select one of eight outputs for each pin. 


18 FDTY Frame Dirty Mode: Allow entire frame to be flagged as dirty whenever a pixel write occurs to the frame 
buffer (this is provided for modes that use a linearly mapped frame buffer for which the line delta is not 
equal to 1024 or 2048 bytes): 0 = Disable; 1 = Enable. 

When disabled, dirty bits are set according to the Y address of the pixel write. 

17 RSVD Reserved: Set to 0. 

16 CMPI Compressor Insert Mode: Insert one static frame between update frames: 0 = Disable; 1 = Enable. 
An update frame is referred to as a frame in which dirty lines will be allowed to be updated. Conversely, 
a static frame is referred to as a frame in which dirty lines will not be updated (although the image may 


not be static, since lines that are not compressed successfully must be retrieved from the uncompressed 
frame buffer). 


15:12 DFIFO Display FIFO High Priority End Level: This field specifies the depth of the display FIFO (in 64-bit 
HI-PRI END | entries x 4) at which a high-priority request previously issued to the memory controller will end. The 
LVL value is dependent upon display mode. 
This register should always be non-zero and should be larger than the start level. 
11:8 DFIFO Display FIFO High Priority Start Level: This field specifies the depth of the display FIFO (in 64-bit 
HI-PRI entries x 4) at which a high-priority request will be sent to the memory controller to fill up the FIFO. The 


START LVL_ | value is dependent upon display mode. 

This register should always be nonzero and should be less than the high-priority end level. 

7:6 DCLK_ DCLK Multiplier: This 2-bit field specifies the clock multiplier for the input DCLK pin. After the input 
MUL clock is optionally multiplied, the internal DOTCLK, PCLK, and FPCLK may be divided as necessary. 


00 = Forced Low 
01 =1xDCLK 
10 =2x DCLK 
11=4x DCLK 


5 DECE Decompression Enable: Allow operation of internal decompression hardware: 
0 = Disable; 1 = Enable. 


4 CMPE Compression Enable: Allow operation of internal compression hardware: 0 = Disable; 1 = Enable 
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Table 4-31 Display Controller Configuration and Status Registers (cont.) 


Bit Name Description 

3 PPC Pixel Panning Compatibility: This bit has the same function as that found in the VGA. 
Allow pixel alignment to change when crossing a split-screen boundary - it will force the pixel alignment 
to be 16-byte aligned: 0 = Disable; 1 = Enable. 
If disabled, the previous alignment will be preserved when crossing a split-screen boundary. 

2 DVCK Divide Video Clock: Selects frequency of VID_CLK pin: 
0 = VID_CLK pin frequency is equal to one-half (2) the frequency of the core clock. 
1 = VID_CLK pin frequency is equal to one-fourth (4) the frequency of the core clock. 
Note: Bit 28 (VIDE) must be set to 1 for this bit to be valid. 

1 CURE Cursor Enable: Allow operation of internal hardware cursor: 0 = Disable; 1 = Enable. 
0 DFLE Display FIFO Load Enable: Allow the display FIFO to be loaded from memory: 


0 = Disable; 1 = Enable. 
If disabled, no write or read operations will occur to the display FIFO. 


If enabled, a flat panel should be powered down prior to setting this bit low. Similarly, if active, a CRT 
should be blanked prior to setting this bit low. 


GX_BASE+8308h-830Bh 


DC_TIMING_CFG Register (R/W) Default Value = xxx00000h 


31 VINT Vertical Interrupt (Read Only): Is a vertical interrupt pending? 0 = No; 1 = Yes. 
(RO) This bit is provided to maintain backward compatibility with the VGA. It corresponds to VGA port 3C2h 
bit 7. 
30 VNA Vertical Not Active (Read Only): Is the active part of a vertical scan is in progress (i.e. retrace, blank- 
(RO) ing, or border)? 0 = Yes; 1 = No. 
This bit is provided to maintain backward compatibility with the VGA. It corresponds to VGA port 
3BA/3DA bit 3. 
29 DNA Display Not Active (Read Only): Is the active part of a line is being displayed (i.e. retrace, blanking, or 
(RO) border)? 0 = Yes; 1 = No. 
This bit is provided to maintain backward compatibility with the VGA. It corresponds to VGA port 
3BA/3DA bit 0. 
28 SENS Monitor Sense (Read Only): This bit returns the result of the voltage comparator test of the RGB lines 
(RO) from the external RAMDAC. The value will be a low level if one or more of the comparators exceed the 
340 mV level indicating an unloaded line. 
This bit can be tested repeatedly to determine the loading on the red, green, and blue lines by loading 
the palette with various values. The BIOS can then determine whether a color, monochrome, or no mon- 
itor is attached. If no RAMDAC is attached, the BIOS should assume that a color panel is attached and 
operate in color mode. For VGA emulation, read operations to port 3C2 bit 4 are redirected here. 
27 DDCI DDC Input (Read Only): This bit returns the value from the DDCIN pin that should reflect the value from 
(RO) pin 12 of the VGA connector. It is used to provide support for the VESA Display Data Channel standard 
level DDC1. 
26:20 RSVD Reserved: Set to 0. 
19:17 PWR_SEQ_ | Power Sequence Delay: This 3-bit field sets the delay between edges for the power sequencing control 
DELAY logic. The actual delay is this value multiplied by one frame period (typically 16ms). 


Note that a value of zero will result in a delay of only one DOTCLK period. 
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Table 4-31 Display Controller Configuration and Status Registers (cont.) 


Bit Name Description 

16 BKRT Blink Rate: 

0 = Cursor blinks on every 16 frames for a duration of 8 frames (approximately 4 times per second) and 
VGA text characters will blink on every 32 frames for a duration of 16 frames (approximately 2 times per 
second). 

1 = Cursor blinks on every 32 frames for a duration of 16 frames (approximately 2 times per second) and 
VGA text characters blink on every 64 frames for a duration of 32 frames (approximately 1 time per sec- 
ond). 

15 PXDB Pixel Double: Allow pixel doubling to stretch the displayed image in the horizontal dimension: 

0 = Disable; 1 = Enable. 

If bit 15 is enabled, timing parameters should be programmed as if no pixel doubling is used, however, 
the frame buffer should be loaded with half the normal pixels per line. Also, the FB_LINE_SIZE parame- 
ter in DC_BUF_SIZE should be set for the number of bytes to be transferred for the line rather than the 
number displayed. 

14 INTL Interlace Scan: Allow interlaced scan mode: 

0 = Disable (non-interlaced scanning is supported) 
1 = Enable (If a flat panel is attached, it should be powered down before setting this bit.) 

13 PLNR VGA Planar Mode: This bit must be set high for all VGA planar display modes. 

12 FCEN Flat Panel Center: Allows the border and active portions of a scan line to be qualified as “active” to a flat 
panel display via the ENADISP signal. This allows the use of a large border region for centering the flat 
panel display. 0 = Disable; 1 = Enable. 

When disabled, only the normal active portion of the scan line will be qualified as active. 

11 FVSP Flat Panel Vertical Sync Polarity: 

0 = Causes TFT vertical sync signal to be normally low, generating a high pulse during sync interval. 
1 = Causes TFT vertical sync signal to be normally high, generating a low pulse during sync interval. 

10 FHSP Flat Panel Horizontal Sync Polarity: 

0 = Causes TFT horizontal sync signal to be normally low, generating a high pulse during sync interval. 
1 = Causes TFT horizontal sync signal to be normally high, generating a low pulse during sync interval. 

9 CVSP CRT Vertical Sync Polarity: 

0 = Causes CRT VSYNC signal to be normally low, generating a high pulse during the retrace interval. 
1 = Cause CRT VSYNC signal to be normally high, generating a low pulse during the retrace interval. 

8 CHSP CRT Horizontal Sync Polarity: 

0 = Causes CRT HSYNC signal to be normally low, generating a high pulse during the retrace interval. 
1 = Causes CRT HSYNC signal to be normally high, generating a low pulse during the retrace interval. 

7 BLNK Blink Enable: Blink circuitry: 0 = Disable; 1 = Enable. 

If enabled, the hardware cursor will blink as well as any pixels. This is provided to maintain compatibility 
with VGA text modes. The blink rate is determined by the bit 16 (BKRT). 

6 VIEN Vertical Interrupt Enable: Generate a vertical interrupt on the occurrence of the next vertical sync 
pulse: 

0 = Disable, vertical interrupt is cleared; 
1 = Enable. 
This bit is provided to maintain backward compatibility with the VGA. 

5 TGEN Timing Generator Enable: Allow timing generator to generate the timing control signals for the display. 
0 = Disable, the Timing Registers may be reprogrammed, and all circuitry operating on the DOTCLK will 
be reset. 

1 = Enable, no write operations are permitted to the Timing Registers. 
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Table 4-31 Display Controller Configuration and Status Registers (cont.) 


Bit Name Description 


4 DDCK DDC Clock: This bit is used to provide the serial clock for reading the DDC data pin. This bit is multi- 
plexed onto the CRTVSYNC pin, but in order for it to have an effect, the VSYE bit must be set low to dis- 
able the normal vertical sync. Software should then pulse this bit high and low to clock data into the 
MediaGX processor. 

This feature is provided to allow support for the VESA Display Data Channel standard level DDC1. 

3 BLKE Blank Enable: Allow generation of the composite blank signal to the display device: 

0 = Disable; 1 = Enable. 

When disabled, the BLANK# output will be a static low level. This allows VESA DPMS compliance. 

2 VSYE Horizontal Sync Enable: Allow generation of the horizontal sync signal to a CRT display device: 

0 = Disable; 1 = Enable. 

When disabled, the HSYNC output will be a static low level. This allows VESA DPMS compliance. 
Note that this bit only applies to the CRT; the flat panel HSYNC is controlled by the automatic power 
sequencing logic. 

1 HSYE Vertical Sync Enable: Allow generation of the vertical sync signal to a CRT display device: 

0 = Disable; 1 = Enable. 

When disabled, the VSYNC output will be a static low level. This allows VESA DPMS compliance. 
Note that this bit only applies to the CRT; the flat panel VSYNC is controlled by the automatic power 
sequencing logic. 

0 FPPE Flat Panel Power Enable: On a low-to-high transition this bit will enable the flat panel power-up 


sequence to begin. This will first turn on VDD to the panel, then start the clocks, syncs, and pixel bus, 
then turn on the LCD bias voltage, and finally the backlight. 


On a high-to-low transition, this bit will disable the outputs in the reverse order. 


GX_BASE+830Ch-830Fh DC_OUTPUT_CFG Register (R/W) Default Value = xxx00000h 
31:16 RSVD Reserved: Set to 0. 
15 DIAG Compressed Line Buffer Diagnostic Mode: This bit will allow testability of the Compressed Line 


Buffer via the diagnostic access registers. A low-to-high transition will reset the Compressed Line Buffer 
write pointer. 0 = Disable (Normal operation); 1 = Enable. 


14 CFRW Compressed Line Buffer Read/Write Select: Enables the read/write address to the Compressed Line 
Buffer for use in diagnostic testing of the RAM. 

0 = Write address enabled 

1 = Read address enabled 

13 PDEH Panel Data Enable High: 

0 = The PANEL[17:9] data bus to be driven to a logic low level to effectively blank an attached flat panel 
display or disable the upper pixel data bus for 16-bit pixel port RAMDACs. 

1 = If no flat panel is attached, the PANEL[17:9] data bus will be driven with active pixel data. If a flat 
panel is attached, setting this bit high will have no effect — the upper panel bus will be driven based upon 
the power sequencing logic. 

12 PDEL Panel Data Enable Low: 

0 = This bit will cause the PANEL[8:0] data bus to be driven to a logic low level to effectively blank an 
attached flat panel display or disable the lower panel data bus if it is not required. 


1= If no flat panel is attached, the PANEL[8:0] data bus will be driven with active pixel data. If a flat panel 
is attached, setting this bit high will have no effect — the lower panel bus will be driven based upon the 
power sequencing logic. 
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Table 4-31 Display Controller Configuration and Status Registers (cont.) 


Bit Name Description 

11 PRMP Palette Re-map: 
0 = The modified codes are sent to the RAMDAC and the external palette should uses the modified 
mapping. 


1 = Bits [8:1] of the palette output register are routed to the RAMDAC data bus. The MediaGX processor 
internal palette RAM may be loaded with 8-bit VGA indices to translate the modified codes stored in dis- 
play memory so that the RAMDAC data bus will contain the expected indices. The modified codes are 
used to achieve character blinking in VGA text modes. This mode should be set high set high only for 
desktop systems with no flat panel attached. It should only be necessary when 8514/A or VESA stan- 
dard feature connector support is required. 

10 CKSL Clock Select: Selects output used to clock PANEL[17:0], FRHSYNC, FPVSYNC, and ENADISP output 
pins. 

1 =PCLK 

0 = FPCLK (based upon the power sequencing logic) 

This bit should be high when using a 16-bit RAMDAC. 

9 FRMS Frame Rate Modulation Select: 

0 = Enables FRM circuitry to change the pattern displayed every frame. 

1 = Enables FRM circuitry to change the pattern displayed every two frames (to allow for slower 
response time liquid crystal materials). 

8 3/4ADD 3- or 4-bit Add: 

0 = Enables dither and FRM circuitry to operate on the 3 most significant bits of each color component 
for 9-bit TFT panels. 

1 = Enables the dither and FRM circuitry to operate on the 4 most significant bits of each color compo- 
nent for 12-bit TFT panels. 

7 2IND 2 Index Enable: Allow two 8-bit pixel indices to be output each PCLK to a 16-bit wide external RAM- 
DAC. This mode is provided to support the 1280x1024x8 BPP display mode. In this mode, the PCLK fre- 
quency is one-half the screen DOTCLK frequency. 0 = Disable; 1 = Enable. 

6 2XCK 2 X Pixel Clock: Double the pixel clock on the 8-bit RAMDAC port so that a single 16-bit pixel can be 
output in two clocks: 

0 = Disable (single pixel will be output on each clock); 

1 = Enable. 

5 2PXE 2 Pixel Enable: If a TFT panel that supports two pixels per clock is attached and active, this bit will 
cause the output mux to combine two pixels and cause the FPCLK to be divided by two: 

0 = Disable (one pixel per clock will be output); 


1 = Enable. 

4 DITE Dither Enable: Allow a 2x2 spatial dither on the 3-bit or 4-bit color value. Note that dither will not be sup- 
ported for 12-bit TFT panels when FRM is enabled. 0 = Disable; 1 = Enable. 

3 FRME Frame-Rate Modulation Enable: Allow FRM to be performed on the 3-bit or 4-bit color value using the 


next most significant bit after the least significant bit sent to the panel. 
0 = Disable (no FRM performed); 

1 = Enable. 

2 PCKE PCLK Enable: 


0 = PCLK is disabled and a low logic level is driven off-chip. Also, the RAMDAC data bus is driven low. 
1 = Enable PCLK to be driven off-chip. 


This clock operates the RAMDAC interface. 
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Table 4-31 Display Controller Configuration and Status Registers (cont.) 


Bit Name Description 


1 16FMT 16 BPP Format: Selects RGB display mode: 

0 = RGB 5-6-5 mode 

1 = RGB 5-5-5 display mode 

This bit is only significant if 8 BPP is low, indicating 16 BPP mode. 
0 8BPP 8 BPP / 16 BPP Select: 


0 = 16-bit per pixel display mode is selected. (Bit 1 of OUTPUT_CONFIG will indicate the format of the 
16 bit data.) 


1 = 8-bit-per-pixel display mode is selected. This is the also the mode used in VGA emulation. 
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4.5.10 Memory Organization 
Registers 


The MediaGX processor utilizes a graphics 
memory aperture that is up to 4MB in size. The 
base address of the graphics memory aperture is 
stored in the DRAM controller. The graphics 
memory is made up of the normal uncompressed 
frame buffer, compressed display buffer, and 
cursor buffer. Each buffer begins at a program- 
mable offset within the graphics memory aperture. 


The various memory buffers are arranged so as to 
efficiently pack the data within the graphics 
memory aperture. This requires flexibility in the 
way that the buffers are arranged when different 
display modes are in use. The cursor buffer is a 
linear block so addressing is straightforward. The 
frame buffer and compressed display buffer are 
arranged based upon scan lines. Each scan line 
has a maximum number of valid or active 
DWORDs and a delta that, when added to the 
previous line offset, points to the next line. In this 
way, the buffers may be stored as linear blocks or 
as logical blocks as may be desired. 


The Memory Organization Registers group 
consists of six 32-bit registers located at 
GX_BASE+8310h-8328h. These registers are 
described below and Table 4-32 gives their bit 
formats. 


Display Controller Frame Buffer Start Address 

(DC_FB_ST_OFFSET) 

- Specifies the offset at which the frame buffer 
starts. 


Display Controller Compression Buffer Start 

Address (DC_CB_ST_OFFSET) 

- Specifies the offset at which the compressed 
display buffer starts. 


Display Controller Cursor Buffer Start Address 

(DC_CURS_ST_OFFSET) 

- Specifies the offset at which the cursor 
memory buffer starts. 


Display Controller Video Start Address 

(DC_VID_ST_OFFSET) 

- Specifies the offset at which the video buffer 
starts. 


Display Controller Line Delta (DC_LINE_DELTA) 
- Stores the line delta for the graphics display 
buffers. 


Display Controller Buffer Size (DC_BUF_SIZE) 
- Specifies the number of bytes to transfer for a 
line of frame buffer data and the size of the 
compressed line buffer. (The compressed line 

buffer will be invalidated if it exceeds the 
CB_LINE_SIZE, bits [15:9].) 
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Table 4-32 Display Controller Memory Organization Registers 


Bit 


Name 


Description 


GX_BASE+8310h-8313h 


DC_FB_ST_OFFSET Register (R/W) Default Value = xxxxxxxxh 


31:22 RSVD Reserved: Set to 0. 
21:0 FB_START | Frame Buffer Start Offset: This value represents the byte offset of the starting location of the dis- 
_OFFSET played frame buffer. This value may be changed to achieve panning across a virtual desktop or to 


allow multiple buffering. 

When this register is programmed to a nonzero value, the compression logic should be disabled. The 
memory address defined by bits [21:4] will take effect at the start of the next frame scan. The pixel off- 
set defined by bits [3:0] will take effect immediately (in general, it should only change during vertical 
blanking). 


GX_BASE+8314h-8317h 


DC_CB_ST_OFFSET Register (R/W) Default Value = xxxxxxxxh 


31:22 RSVD Reserved: Set to 0. 
21:0 CB_START | Compressed Display Buffer Start Offset: This value represents the byte offset of the starting loca- 
_OFFSET tion of the compressed display buffer. Bits [3:0] should always be programmed to zero so that the start 


offset is aligned to a 16-byte boundary. This value should change only when a new display mode is set 
due to achange in size of the frame buffer. 


GX_BASE+8318h-831Bh 


DC_CUR_ST_OFFSET Register (R/W) Default Value = xxxxxxxxh 


31:22 RSVD Reserved: Set to 0. 
21:0 CUR_START | Cursor Start Offset: This value represents the byte offset of the starting location of the cursor display 
_OFFSET pattern. Bits [1:0] should always be programmed to zero so that the start offset is DWORD aligned. 


The cursor data will be stored as a linear block of data. The active cursor will always be 32x32x2 bits 
in size. Multiple cursor patterns may be loaded into off-screen memory. The start offset is loaded at the 
start of a frame. Each cursor pattern will be exactly 256 bytes in size. Note that if there is a Y offset for 
the cursor pattern, the cursor start offset should be set to point to the first displayed line of the cursor 
pattern. The cursor code for a given pixel is determined by an AND mask and an XOR mask. Each line 
of acursor will be stored as two DWORDs, with each DWORD containing the AND masks for 16 pixels 
in the upper word and the XOR masks for 16 pixels in the lower word. DWORDs will be arranged with 
the leftmost block of 16 pixels being least significant and the rightmost block being most significant. 
Pixels within words will be arranged with the leftmost pixels being most significant and the rightmost 
pixels being least significant. The 2-bit cursor codes are as follows. 


AND XOR Displayed 

0 0 Cursor Color 0 

0 1 Cursor Color 1 

1 0 Transparent — Background Pixel 

1 1 Inverted — Bit-wise Inversion of Background Pixel 


GX_BASE+831Ch-831Fh 


Reserved Default Value = 00000000h 


GX_BASE+8320h-8323h 


DC_VID_ST_OFFSET Register (R/W) Default Value = xxxxxxxxh 


31:21 RSVD Reserved: Set to 0. 
20:0 VID_START | Video Buffer Start Offset Value: This is the value for the Video Buffer Start Offset. It represents the 
_OFFSET starting location for Video Buffer. Bits [3:0] should always be programmed as zero so that the start off- 


set is aligned to a 16 byte boundary. 
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Table 4-32 Display Controller Memory Organization Registers (cont.) 


Bit 


Name 


Description 


GX_BASE+8324h-8327h 


DC_LINE_DELTA Register (R/W) Default Value = xxxxxxxxh 


31:22 RSVD Reserved: Set to 0. 
21:12 CB_LINE__ | Compressed Display Buffer Line Delta: This value represents number of DWORDs that, when 
DELTA added to the starting offset of the previous line, will point to the start of the next compressed line in 
memory. It is used to always maintain a pointer to the starting offset for the compressed display buffer 
line being loaded into the display FIFO. 
11:10 RSVD Reserved: Set to 0. 
9:0 FB_LINE_ | Frame Buffer Line Delta: This value represents number of DWORDs that, when added to the starting 
DELTA offset of the previous line, will point to the start of the next frame buffer line in memory. It is used to 


always maintain a pointer to the starting offset for the frame buffer line being loaded into the display 
FIFO. 


GX_BASE+8328h-832Bh 


DC_BUF_SIZE Register (R/W) Default Value = xxxxxxxxh 


31:30 RSVD Reserved: Set to 0. 
29:16 VID_BUF__ | Video Buffer Size: These bits set the video buffer size, in 64-byte segments. The maximum size is 
SIZE 1MB. 
15:9 CB_LINE_ | Compressed Display Buffer Line Size: This value represents the number of DWORDs for a valid 
SIZE compressed line plus 1. It is used to detect an overflow of the compressed data FIFO. It should never 
be larger than 41h or 65Dh since the maximum size of the compressed data FIFO is 64 DWORDs. 
8:0 FB_LINE_ | Frame Buffer Line Size: This value specifies the number of QWORDS (8-byte segments) to transfer 
SIZE for each display line from the frame buffer. 


If panning is enabled, this value can generally be programmed to the displayed number of QWORDS 
+ 2 so that enough data is transferred to handle any possible alignment. Extra pixel data in the FIFO at 
the end of a line will automatically be discarded. 


GX_BASE+832Ch-832Fh 


Reserved Default Value = 00000000h 
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4.5.11. Timing Registers 


The MediaGX processor timing registers control 
the generation of sync, blanking, and active display 
regions. They provide complete flexibility in inter- 
facing to both CRT and flat panel displays. These 
registers will generally be programmed by the 
BIOS from an INT 10h call or by the extended 
mode driver from a display timing file. Note that the 
horizontal timing parameters are specified in char- 
acter clocks, which actually means pixels divided 
by 8, since all characters are bit mapped. For inter- 
laced display the vertical counter will be incre- 
mented twice during each display line, so vertical 
timing parameters should be programmed with 
reference to the total frame rather than a single 
field. 


The Timing Registers group consists of six 32-bit 
registers located at GX_BASE+8330h-834Ch. 
These registers are described below and Table 4- 
33 gives their bit formats. 


¢ Display Controller Horizontal and Total Timing 
(DC_H_TIMING_1) 
- Contains horizontal active and total timing 
information. 


¢ Display Controller CRT Horizontal Blanking 
Timing (DC_H_TIMING_2 Register) 
- Contains CRT horizontal blank timing infor- 
mation. 


Display Controller CRT Sync Timing 

(DC_H_TIMING_3) 

- Contains CRT horizontal sync timing informa- 
tion. Note, however, that this register should 
also be programmed appropriately for flat 
panel only display since the horizontal sync 
transition determines when to advance the 
vertical counter. 


Display Controller Flat Panel Horizontal Sync 

Timing (DC_FP_H_TIMING) 

- Contains horizontal sync timing information 
for an attached flat panel display. 


Display Controller Vertical and Total Timing 

(DC_V_TIMING_1) 

- Contains vertical active and total timing infor- 
mation. The parameters pertain to both CRT 
and flat panel display. 


Display Controller CRT Vertical Blank Timing 
(DC_V_TIMING_2) 
- Contains vertical blank timing information. 


Display Controller CRT Vertical Sync Timing 

(DC_V_TIMING_3) 

- Contains CRT vertical sync timing informa- 
tion. 


Display Controller Flat Panel Vertical Sync 

Timing (DC_FP_V_TIMING) 

- Contains flat panel vertical sync timing infor- 
mation. 
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Table 4-33 Display Controller Timing Registers 


Bit Name Description 
GX_BASE+8330h-8333h DC_H_TIMING_1 Register (R/W) Default Value = xxxxxxxxh 

31:27 RSVD Reserved: Set to 0. 

26:19 H_TOTAL Horizontal Total: This field represents the total number of character clocks for a given scan line 
minus 1. Note that the value is necessarily greater than the H_ACTIVE field because it includes 
border pixels and blanked pixels. For flat panels, this value will never change. The field [26:16] 
may be programmed with the pixel count minus 1, although bits [18:16] are ignored. The horizon- 
tal total is programmable on 8-pixel boundaries only. 

18:16 RSVD Reserved: These bits are readable and writable but have no effect. 

15:11 RSVD Reserved: Set to 0. 

10:3 H_ACTIVE Horizontal Active: This field represents the total number of character clocks for the displayed 
portion of a scan line minus 1. The field [10:0] may be programmed with the pixel count minus 1, 
although bits [2:0] are ignored. The active count is programmable on 8-pixel boundaries only. Note 
that for flat panels, if this value is less than the panel active horizontal resolution (H_PANEL), the 
parameters H_BLANK_START, H_BLANK_END, H_SYNC_START, and H_SYNC_END should 
be reduced by the value of H_ADJUST (or the value of H_PANEL - H_ACTIVE / 2)to achieve hor- 
izontal centering. 

2:0 RSVD Reserved: These bits are readable and writable but have no effect. 


Note: Note also that for simultaneous CRT and flat panel display the H_-ACTIVE and H_TOTAL parameters pertain to both. 


GX_BASE+8334h-8337h 


DC_H_TIMING_2 Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 

26:19 H_BLK_END Horizontal Blank End: This field represents the character clock count at which the horizontal 
blanking signal becomes inactive minus 1. The field [26:16] may be programmed with the pixel 
count minus 1, although bits [18:16] are ignored. The blank end position is programmable on 8- 
pixel boundaries only. 

18:16 RSVD Reserved: These bits are readable and writable but have no effect. 

15:11 RSVD Reserved: Set to 0. 

10:3 H_BLK_START | Horizontal Blank Start: This field represents the character clock count at which the horizontal 
blanking signal becomes active minus 1. The field [10:0] may be programmed with the pixel count 
minus 1, although bits [2:0] are ignored. The blank start position is programmable on 8-pixel 
boundaries only. 

2:0 RSVD Reserved: These bits are readable and writable but have no effect. 


Note: A minimum of four character clocks is required for the horizontal blanking portion of a line in order for the timing generator 


to function correctly. 
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Table 4-33 Display Controller Timing Registers (cont.) 


Bit 


Name 


Description 


GX_BASE+8338h-833Bh 


DC_H_TIMING_3 Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 

26:19 H_SYNC_END | Horizontal Sync End: This field represents the character clock count at which the CRT horizontal 
sync signal becomes inactive minus 1. The field [26:16] may be programmed with the pixel count 
minus 1, although bits [18:16] are ignored. The sync end position is programmable on 8-pixel 
boundaries only. 

18:16 RSVD Reserved: These bits are readable and writable but have no effect. 

15:11 RSVD Reserved: Set to 0. 

10:3 H_SYNC_START | Horizontal Sync Start: This field represents the character clock count at which the CRT horizon- 
tal sync signal becomes active minus 1. The field [10:0] may be programmed with the pixel count 
minus 1, although bits [2:0] are ignored. The sync start position is programmable on 8-pixel 
boundaries only. 

2:0 RSVD Reserved: These bits are readable and writable but have no effect. 


Note: This register should also be programmed appropriately for flat panel only display since the horizontal sync transition deter- 
mines when to advance the vertical counter. 


GX_BASE+833Ch-833Fh 


C_FP_H_TIMING Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 
26:16 FP_H_SYNC Flat Panel Horizontal Syne End: This field represents the pixel count at which the flat panel hor- 
_END izontal sync signal becomes inactive minus 1. 
15:11 RSVD Reserved: Set to 0. 
10:0 FP_H_SYNC Flat Panel Horizontal Sync Start: This field represents the pixel count at which the flat panel hor- 
_START izontal sync signal becomes active minus 1. 


Note: All values are specified in pixels rather than character clocks to allow precise control over sync position. Note, however, that 
for flat panels which combine two pixels per panel clock, these values should be odd numbers (even pixel boundary) to 
guarantee that the sync signal will meet proper setup and hold times. 


GX_BASE+8340h-8343h 


DC_V_TIMING_1 Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 

26:16 V_TOTAL Vertical Total: This field represents the total number of lines for a given frame scan minus 1. Note 
that the value is necessarily greater than the V_ACTIVE field because it includes border lines and 
blanked lines. If the display is interlaced, the total number of lines must be odd, so this value 
should be an even number. 

15:11 RSVD Reserved: Set to 0. 

10:0 V_ACTIVE Vertical Active: This field represents the total number of lines for the displayed portion of a frame 


scan minus 1. Note that for flat panels, if this value is less than the panel active vertical resolution 
(V_PANEL), the parameters V_BLANK_START, V_BLANK_END, V_SYNC_START, and 
V_SYNC_END should be reduced by the following value (V_ADJUST) to achieve vertical center- 
ing: V_ADJUST = (V_PANEL - V_ACTIVE) / 2 

If the display is interlaced, the number of active lines should be even, so this value should be an 
odd number. 


Note: All values are specified in lines. 
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Table 4-33 Display Controller Timing Registers (cont.) 


Bit 


Name 


Description 


GX_BASE+8344h-8347h 


DC_V_TIMING_2 Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 

26:16 V_BLANK_END | Vertical Blank End: This field represents the line at which the vertical blanking signal becomes 
inactive minus 1. If the display is interlaced, no border is supported, so this value should be identi- 
cal to V_TOTAL. 

15:11 RSVD Reserved: Set to 0. 

10:0 V_BLANK_ Vertical Blank Start: This field represents the line at which the vertical blanking signal becomes 
START active minus 1. If the display is interlaced, this value should be programmed to V_ACTIVE plus 1. 


Note: All values are specified in lines. For interlaced display, no border is supported, so blank timing is implied by the total/active 
timing. 


GX_BASE+8348h-834Bh 


DC_V_TIMING_3 Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 

26:16 V_SYNC_END | Vertical Sync End: This field represents the line at which the CRT vertical sync signal becomes 
inactive minus 1. 

15:11 RSVD Reserved: Set to 0. 

10:0 V_SYNC_START | Vertical Sync Start: This field represents the line at which the CRT vertical sync signal becomes 


active minus 1. For interlaced display, note that the vertical counter is incremented twice during 
each line and since there are an odd number of lines, the vertical sync pulse will trigger in the mid- 
dle of a line for one field and at the end of a line for the subsequent field. 


Note: All values are specified in lines. 


GX_BASE+834Ch-834Fh 


DC_FP_V_TIMING Register (R/W) Default Value = xxxxxxxxh 


31:27 RSVD Reserved: Set to 0. 
26:16 FP_V_SYNG Flat Panel Vertical Sync End: This field represents the line at which the flat panel vertical sync 
_END signal becomes inactive minus 2. Note that the internal flat panel vertical sync is latched by the flat 
panel horizontal sync prior to being output to the panel. 
15:11 RSVD Reserved: Set to 0. 
10:0 FP_VSYNC Flat Panel Vertical Sync Start: This field represents the line at which the internal flat panel verti- 
_START cal sync signal becomes active minus 2. Note that the internal flat panel vertical sync is latched by 


the flat panel horizontal sync prior to being output to the panel. 


Note: All values are specified in lines. 
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4.5.12 Cursor Position Registers 


The Cursor Position Registers contain pixel coordi- 
nate information for the cursor. These values are 
not latched by the timing generator until the start of 
the frame to avoid tearing artifacts when moving 
the cursor. 


The Cursor Position group consists of four 32-bit 
registers located at to GX_BASE+8350h-835Ch. 
These registers are described below and Table 4- 
34 gives their bit formats. 


¢ Display Controller Cursor X Position 
(DC_CURSOR_X) 
- Contains the X position information of the 
hardware cursor. 


Display Controller Vertical Line Count 

(DC_V_LINE_CNT) 

- This register is read only. It provides the 
current scanline for the display. It is used by 
software to time update of the frame buffer to 
avoid tearing artifacts. 


Display Controller Cursor Y Position 

(DC_CURSOR_Y) 

- Contains the Y position information of the 
hardware cursor. 


Display Controller Split-Screen Line Compare 

(DC_SS_LINE_CMP) 

- Contains the line count at which the lower 
screen begins in a VGA split-screen mode. 


Table 4-34 Display Controller Cursor Position Registers 


Bit Name 


Description 


GX_BASE+8350h-8353h 


DC_CURSOR_X Register (R/W) 


Default Value = xxxxxxxxh 


31:16 RSVD Reserved: Set to 0. 


15:11 X_OFFSET | X Offset: This field represents the X pixel offset within the 32x32 cursor pattern at which the displayed 

portion of the cursor is to begin. Normally, this value is set to zero to display the entire cursor pattern, 

but for cursors for which the "hot spot" is not at the left edge of the pattern, it may be necessary to dis- 
play the rightmost pixels of the cursor only as the cursor moves close to the left edge of the display. 


10:0 CURSOR_X | Cursor X: This field represents the X coordinate of the pixel at which the upper left corner of the cur- 
sor is to be displayed. This value is referenced to the screen origin (0,0) which is the pixel in the upper 
left corner of the screen. 


GX_BASE+8354h-8357h 


31:11 RSVD Reserved (Read Only) 

10:0 V_LINE_CNT | Vertical Line Count (Read Only): This value is the current scanline of the display. 

(RO) 

Note: The value in this register is driven directly off of the DOTCLK, and consequently it is not synchronized with the CPU clock. 
Software should read this register twice and compare the result to ensure that the value is not transitioning. 


DC_V_LINE_CNT Register (RO) Default Value = xxxxxxxxh 
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Table 4-34 Display Controller Cursor Position Registers (cont.) 


Bit Name Description 
GX_BASE+8358h-835Bh DC_CURSOR_Y Register (R/W) Default Value = xxxxxxxxh 
31:16 RSVD Reserved: Set to 0. 


15:11 Y_OFFSET | Y Offset: This field represents the Y line offset within the 32x32 cursor pattern at which the displayed 
portion of the cursor is to begin. Normally, this value is set to zero to display the entire cursor pattern, 
but for cursors for which the "hot spot" is not at the top edge of the pattern, it may be necessary to dis- 
play the bottommost lines of the cursor only as the cursor moves close to the top edge of the display. 
Note that if this value is nonzero, the CUR START OFFSET must be set to point to the first cursor 
line to be displayed. 

10 RSVD Reserved: Set to 0. 

9:0 CURSOR_Y | Cursor Y: This field represents the Y coordinate of the line at which the upper left corner of the cursor 
is to be displayed. This value is referenced to the screen origin (0,0) which is the pixel in the upper left 
corner of the screen. 

This field is alternately used as the line-compare value for a newly-programmed frame buffer start off- 
set. This is necessary for VGA programs that change the start offset in the middle of a frame. In order 
to use this function, the hardware cursor function should be disabled. 


GX_BASE+835Ch-835Fh DC_SS_LINE_CMP Register (R/W) Default Value = xxxxxxxxh 
31:11 RSVD Reserved: Set to 0. 
10:0 SS_LINE_C | Split-Screen Line Compare: This is the line count at which the lower screen begins in a VGA split- 
MP screen mode. 


Note: When the internal line counter hits this value, the frame buffer address is reset to 0. This function is enabled with the SSLC 
bit in the DC_GENERAL_CFG register. 
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4.5.13 Color Registers These registers are described below and Table 4- 


These registers are used in 8 BPP display mode 35 gives their bit formats. 


with an external RAMDAC for passing cursor and * Display Controller Cursor Color 

border color indices to the palette in the RAMDAC. (DC_CURSOR_COLOR) 

For the flat panel color translation, the cursor and “ i 

border color data is loaded into palette extensions 

as described in the Palette Access Registers 

section. * Display Controller Border Color 

(DC_BORDER_COLOR) 

- Contains the 8-bit index for the border or 
overscan color. 


- Contains the 8-bit indices for the cursor 
colors. 


The Color Registers group consists of two 32-bit 
registers located at GX_BASE+8360h-8368h. 


Table 4-35 Display Controller Color Registers 


Bit Name Description 
GX_BASE+8360h-8363h DC_CURSOR_COLOR Register (R/W) Default Value = xxxxxxxxh 
31:16 RSVD Reserved: Set to 0. 
15:8 CURS_CLR_1 Cursor Color 1: This is the 8-bit index to the external palette for the cursor color 1. It should point 
to a reserved or static color. 
7:0 CURS_CLR_0O | Cursor Color 0: This is the 8-bit index to the external palette for the cursor color 0. It should point 
to a reserved or static color. 


GX_BASE+8364h-8367h Reserved Default Value = 00000000h 
GX_BASE+8368h-836Bh DC_BORDER_COLOR Register (RO) Default Value = xxxxxxxxh 
31:8 RSVD Reserved: Set to 0. 
7:0 BORDER_CLR_| Border Color: This is the 8-bit index to the external palette for the border color. It should point to a 
reserved or static color. 


GX_BASE+836Ch-836Fh Reserved Default Value = 00000000h 
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These registers are used for accessing the internal 
palette RAM and extensions. In addition to the 
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Palette Access Registers * Display Controller Palette Data (DC_PAL DATA) 


- Contains the data for a palette access cycle. 


* Display Controller Display FIFO Diagnostic 


standard 256 entries for 8 BPP color translation, (DC_DFIFO_ DIAG) 


the MediaGX processor palette has extensions for 
cursor colors and overscan (border) color. 


The Palette Access Register group consists of four 
32-bit registers located at GX_BASE+8370h- 
837Ch. These registers are described below and 
Table 4-36 gives their bit formats. 


- This register is provided to enable testability 
of the Display FIFO RAM. 


¢ Display Controller Compression FIFO Diag- 
nostic (DC_CFIFO_DIAG) 
- This register is provided to enable testability 
of the Compressed Line Buffer (FIFO) RAM. 


* Display Controller Palette Address 


(DC_ 


PAL_ADDRESS) 


- This register should be written with the 
address (index) location to be used for the 
next access to the DC_PAL_DATA register. 


Table 4-36 Display Controller Palette and RAM Diagnostic Registers 


Bit 


Name 


Description 


GX_BASE+8370h-8373h 


DC_PAL_ADDRESS Register (R/W) Default Value = xxxxxxxxh 


31:9 


RSVD 


Reserved: Set to 0. 


8:0 


PALETTE_ADDR 


Palette Address: This 9-bit field specifies the address to be used for the next access to the 
DC_PAL_DATA register. Each access to the data register will automatically increment the palette 
address register. If non-sequential access is made to the palette, the address register must be 
loaded between each non-sequential data block. The address ranges are as follows. 


Address Color 

Oh - FFh Standard Palette Colors 
100h Cursor Color 0 

101h Cursor Color 1 

102h Reserved 

103h Reserved 

104h Overscan Color 

105h - 1FFh Not Valid 


Note that in general, 18-bit values will be loaded for all color extensions. However, if a 16 BPP 
mode is active, only the appropriate most significant bits will be used (5-5-5 or 5-6-5). If an 8 BPP 
display mode is active and an external RAMDAC is used, the cursor index will be obtained from 
the DC_CURSOR_COLOR register. The border index will be obtained from the 
DC_BORDER_COLOR register. 


GX_BASE+8374h-8377h 


DC_PAL_DATA Register (R/W) Default Value = xxxxxxxxh 


31:18 RSVD 


Reserved: Set to 0. 


17:0 


PALETTE_DATA 


Palette Data: This 18-bit field contains the read or write data for a palette access. 


Note: When a read or write to the palette RAM occurs, the previous output value will be held for one additional DOTCLK period. 
This effect should go unnoticed and will provide for sparkle-free update. Prior to a read or write to this register, the 
DC_PAL_ADDRESS register should be loaded with the appropriate address. The address automatically increments after 
each access to this register, so for sequential access, the address register need only be loaded once 
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Table 4-36 Display Controller Palette and RAM Diagnostic Registers (cont.) 


Bit Name Description 
GX_BASE+8378h-837Bh DC_DFIFO_DIAG Register (R/W) Default Value = xxxxxxxxh 
31:0 DISPLAY FIFO | Display FIFO Diagnostic Read or Write Data: Before this register is accessed, the DIAG bit in 
DIAGNOSTIC DC_GENERAL_CFG register should be set high and the DFLE bit should be set low. Since, each 
DATA FIFO entry is 64 bits, an even number of write operations should be performed. Each pair of write 


operations will cause the FIFO write pointer to increment automatically. After all write operations 
have been performed, a single read of don't care data should be performed to load data into the 
output latch. Each subsequent read will contain the appropriate data which was previously written. 
Each pair of read operations will cause the FIFO read pointer to increment automatically. A pause 
of at least four core clocks should be allowed between subsequent read operations to allow ade- 
quate time for the shift to take place. 


GX_BASE+837Ch-837Fh DC_CFIFO_DIAG Register (R/W) Default Value = xxxxxxxxh 


31:0 COMPRESSED | Compressed Data FIFO Diagnostic Read or Write Data: Before this register is accessed, the 
FIFO DIAGNOS- | DIAG bit in DC_GENERAL_CFG register should be set high and the DFLE bit should be set low. 
TIC DATA Also, the DIAG bit in DC_OUTPUT_CFG should be set high and the CFRW bit in 

DC_OUTPUT_CFG should be set low. After each write, the FIFO write pointer will automatically 
increment. After all write operations have been performed, the CFRW bit of DC_OUTPUT_CFG 
should be set high to enable read addresses to the FIFO and a single read of don't care data 
should be performed to load data into the output latch. Each subsequent read will contain the 
appropriate data which was previously written. After each read, the FIFO read pointer will auto- 


matically increment. 
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4.5.15 Cx5520/Cx5530 Display RAMDAC is eliminated. The Cx5520/Cx5530 
Controller Interface contains the DACs, a video accelerator engine, 


As previously stated in Section 1.3 “System anole. TE Lanlertaces 


Designs” on page 7, the MediaGX processor can A MediaGxX processor and Cx5520/Cx5530-based 


interface with either the Cx5520 or CxS530 1/0 system supports both portable and desktop config- 
Companion chip. This section will discuss the urations. Figure 4-16 shows the signal connections 
specifics on signal connections between the two for both types of systems. 


devices with regards to the display controller. 


When the MediaGX processor is used in a system 
with the Cx5520/Cx5530, the need for an external 


Portable 
Configuration 


FP_ENA VDD 
MediaGx FP_ENA_BKL 


Processor FP_DISP_ENA_OUT 


FP_HSYNC 
PCLK PCLK FP_VSYNC 
VID_CLK VID_CLK FP_CLK 
DCLK DCLK 
reuse FS] PEC | ona 
ENA _DISP FP_DISP_ENA FP_DATA[11:16] 
VID_RDY VID_RDY FP_DATA[5:0] 
VID_DATAI7:0] VID_DATA|7:0] 
PIXEL[17:12] PIXEL[23:18]}" 


PIXEL[11:6] PIXEL[15:10}* 
PIXEL[5:0] PIXEL(7:2]* HSYNG_OUT. Pin 13 Pin3 


VID_VAL VID_VAL VSYNC_OUT Pin 14 Pin2 


CRT_HSYNC HSYNG Pin 1 
CRT_VSYNC VSYNG 


VGA 
DDC_SCL Pin 15 Port 
DDC_SDA Pin 12 


Cx5520/Cx5530 
1/0 Companion 


IOUTR 
IOUTG 
IOUTB 


Note: *Connect PIXEL[17:16] PIXEL[9:8], and PIXEL[1:0] on the Cx5520 to ground. 


Figure 4-16 Display Controller Signal Connections 
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4.5.15.1 Cx5520/Cx5530 Video Port VID_DATAJ7:0] is advanced when both VID_VAL 
Data Transfer and VID_RDY are asserted. VID_RDY is driven 


decal ,; one clock early to the MediaGX processor while 
VID_VAL indicates that the MediaGX processor Pir ae ; 
has placed valid:data on VID_DATAIZ-0]. VIDSRDY: = oven colneidentt with VIDADATA 0]. 
indicates that the Cx5520/Cx5530 is ready to A sample interface functional timing diagram is 
shown in Figure 4-17. 
accept the next byte of video data. 


| | 
VID_VAL 8 CLKs | 8 + 3ICLKs 
| | 


VID_RDY | 3 CLKs | 
VID_DATA 
~ 17:0) X 4CLKs 


CLK CLKs CLK CLKs CLKs 


Note: VID_CLK = CORE_CLK/2 


Figure 4-17 Video Port Data Transfer (Cx5520/Cx5530) 
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4.6 PCI Controller 


The MediaGX processor includes an integrated 
PCI controller with the following features. 


4.6.1 X-Bus PCI Slave 

* 16-byte PCI write buffer 

* 16-byte PCI read buffer from X-bus 

* Supports cache line bursting 

¢ Write/Inv line support 

¢ Pacing of data for read or write operations with 
X-bus 

¢ No active byte enable transfers supported 


4.6.2 X-Bus PCI Master 

* 16 byte X-bus to PCI write buffer 

* Configuration read/write Support 

« Int Acknowledge support 

« Lock conversion 

* Support fast back-to-back cycles as slave 


4.6.3 PCI Arbiter 

¢ Fixed, rotating, hybrid, or ping-pong arbitration 
(programmable) 

¢ Support four masters, three on PCI 

¢ Internal REQ for CPU 

¢ Master retry mask counter 

¢ Master dead timer 

« Resource or total system lock support 


PCI Controller 


4.6.4 Generating Configuration 


Cycles 


Configuration space is a physical address space 
unique to PCI. Configuration Mechanism #1 must 
be used by software to generate configuration 
cycles. Two DWORD I/O locations are used in this 
mechanism. The first DWORD location (CF8h) 
references a read/write register that is named 
CONFIG_ADDRESS. The second DWORD 
address (CFCh) references a register named 
CONFIG_DATA. The general method for accessing 
configuration space is to write a value into 
CONFIG_ADDRESS that specifies the PCI bus, 
device on that bus, and configuration register in 
that device being accessed. A read or write to 
CONFIG_DATA will then cause the bridge to trans- 
late that CONFIG_ADDRESS value to the 
requested configuration cycle on the PCI bus. 


4.6.5 Generating Special Cycles 


A special cycle is a broadcast message to the PCI 
bus. Two hardcoded special cycle messages are 
defined in the command encode: HALT and SHUT- 
DOWN. Software can also generate special cycles 
by using special cycle generation for configuration 
mechanism #1 as described in the PCI Specifica- 
tion 3.6.4.1.2 and briefly described here. To initiate 
a special cycle from software, the host must write a 
value to CONFIG_ADDRESS encoded as shown 
in Table 4-37. 


The next value written to CONFIG_DATA is the 
encoded special cycle. Type 0 or Type 1 conver- 
sion will be based on the Bus Bridge number 
matching the MediaGX processor’s bus number of 
OOh. 


Table 4-37 Special-Cycle Code to CONFIG_ADDRESS 


31 30 24 23 |22/21|20]19|18/17/16|15|14/13/12/11/10| 9| 8|7|/6)/5/4]3]2]1]| 0 
1 dddddqdd Bus No. = Bridge fe [ae te Tae at al TA |e 0s |] Ou | 0 

CONFIG_EN | RSVD BUS NUMBER DEVICE NUMBER | FUNCTION | REGISTER NUMBER | TRANS 

NUMBER LATION 

TYPE 
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4.6.6 PCI Configuration Space accesses are permitted to this register all others 
Control Registers will be forwarded as normal I/O cycles to the PCI 
bus. 


There are two registers in this category: 
CONFIG_ADDRESS and CONFIG_DATA. The CONFIG_DATA register contains the data that 


The CONFIG_ADDRESS register contains the is sent or received during a PCI configuration 
; space access. 
address information for the next configuration 


space access to CONFIG_DATA. Only DWORD Table 4-38 gives the bit formats for these two regis- 
ters. 


Table 4-38 PCI Configuration Registers 


Bit Name Description 
1/O Offset OCF8h-OCFBh CONFIG_ADDRESS Register (R/W) Default Value = 00000000h 
31 GFC_EN CONFIG ENABLE: Determines when accesses should be translated to configuration cycles on 


the PCI bus, or treated as a normal I/O operation. This register will be updated only on full 
DWORD I/O operations to the CONFIG_ADDRESS. Any other accesses are treated as normal I/O 
cycles in order to allow I/O devices to use BYTE or WORD registers at the same address an 
remain unaffected. Once bit 31 is set high, subsequent accesses to CONFIG_DATA are then 
translated to configuration cycles. 


30:24 RSVD Reserved: Set to 0. 
23:16 BUS Bus: Specifies a PCI bus number in the hierarchy of 1 to 256 buses. 
15:11 DEVICE Device: Selects a device on a specified bus. A device value of 00h will select the MediaGX pro- 


cessor if the bus number is also 00h. DEVICE values of 01h to 15h will be mapped to AD[31:11], 
so only 21 of the 32 possible devices are supported. A DEVICE value of 00001b will map to 
AD[11] while a device of 10101b will map to AD[31]. 


10:8 FUNCTION Function: Selects a function in a multi-function device. 
7:2 REGISTER Register: Chooses a configuration space register in the selected device. 
1:0 TT Translation Type Bits: These bits indicate if the configuration access is local or one that requires 


translation through other bridges to another PCI bus. When an access occurs to the 
CONFIG_DATA address and the specified bus number matches the MediaGX processor’s bus 
number (00h), then a Type 0 translation takes place. 

For a Type 0 translation, the CONFIG_ADDRESS register values are translated to AD lines on the 
PCI bus. Note that bits 10:2 are passed unchanged. The DEVICE value is mapped to one of 21 
AD lines. The translation type bits are set to 00 to indicate a transaction on the local PCI bus. 
When an access occurs to the CONFIG_DATA address and the specified bus number is not 00h 
(Type 1), the MediaGX processor passes this cycle to the PCI bus by copying the contents of the 
CONFIG_ADDRESS register onto the AD lines during the address phase of the cycle while driving 
the translation type bits AD[1:0] to 01. Note that the MediaGX processor and Cx5520 system does 
not support Type 1 transfers. 


/O Offset OCFCh-OCFFh CONFIG_DATA (R/W) Default Value = 00000000h 


31:0 CONFIG_DATA | Configuration Data Register: Contains the data that is sent or received during a PCI configura- 
tion space access. The register accessed is determined by the value in the CONFIG_ADDRESS 
register. The CONFIG_DATA register supports BYTE, WORD, or DWORD accesses. To access 
this register, bit 31 of the CONFIG_ADDRESS register must be set to 0 and a full DWORD I/O 
access must be done. Configuration cycles are performed when bit 31 of the CONFIG_ADDRESS 
register is set to 1 
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4.6.7 PCI Configuration Space I/O write to Port OCF8h. Also, when entering the 
Registers Configuration Index, only the six most significant 
bits of the offset are used, and the two least signifi- 


To access the internal PCI configuration registers Gant bis muskbe-00b. 


of the MediaGX processor, the Configuration 
Address Register (CONFIG_ADDRESS) must be Table 4-40 summarizes the registers located within 


written as a DWORD using the format shown in the Configuration Space. The tables that follow, 
Table 4-39. Any other size will be interpreted as an give detailed register/bit formats. 


Table 4-39 Format for Accessing the Internal PCI Configuration Registers 


31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20) 19) 18) 17) 16/15) 14)13)12)11/;10;9|/8)7);6;/5;/4);3);2)]1/0 
1 RESERVED 0/;0/;0;0};0;0;/0/0;0]0/0/0);0j)j0{]0/ 0] Configuration Index | 0 


Table 4-40 PCI Configuration Space Register Summary 


Index Type Name Default Value 
00h-01h RO Vendor Identification 1078h 
02h-03h RO Device Identification 0001h 
04h-05h R/W PCI Command 0007h 
06h-07h R/W Device Status 0280h 
08h RO Revision Identification 00h 
09h-OBh RO Class Code 060000h 
0Ch RO Cache Line Size 00h 
ODh R/W Latency Timer ODh 
OEh-3Fh + Reserved 00h 
40h R/W PCI Control Function 1 00h 
41h R/W PCI Control Function 2 96h 
42h -- Reserved 00h 
43h R/W PCI Arbitration Control 1 80h 
44h R/W PCI Arbitration Control 2 00h 
45h-FFh -- Reserved 00h 
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Table 4-41 PCI Configuration Registers 


Bit Name Description 
Index 00h-01h Vendor Identification Register (RO) Default Value = 1078h 
31:0 VID (RO) Vendor Identification Register (Read Only): The combination of this value and the device ID 
uniquely identifies any PCI device. The Vendor ID is the ID given to Cyrix Corporation by the PCI SIG. 


Index 02h-03h Device Identification Register (RO) Default Value = 0001h 


31:0 DID (RO) Device Identification Register (Read Only): This value along with the vendor ID uniquely identifies 
any PCI device. 


Index 04h-05h PCI Command Register (R/W) Default Value = 0007h 
15:10 RSVD Reserved: Set to 0. 
9 FBE Fast Back-to-Back Enable: As a master, the MediaGX processor does not support this function. 
This bit returns 0. 
8 SERR SERR# Enable: This is used as an output enable gate for the SERR# driver. 
WAT Wait Cycle Control: MediaGX processor does not do address/ data stepping. 


This bit is always set to 0. 


6 PE Parity Error Response: 
0 = MediaGX processor ignores parity errors on the PCI bus. 
1 = MediaGX processor checks for parity errors. 


5 VPS VGA Palette Snoop: MediaGX processor does not support this function. 
This bit is always set to 0. 
4 MS Memory Write and Invalidate Enable: As a master, the MediaGX processor does not support this 
function. 
This bit is always set to 0. 
3 SPC Special Cycles: MediaGX processor does not respond to special cycles on the PCI bus. 
This bit is always set to 0. 
2 BM Bus Master: 


0 = MediaGX processor does not perform master cycles on the PCI. 
1 = MediaGX processor can act as a bus master on the PCI. 


1 MS Memory Space: MediaGX processor will always respond to memory cycles on the PCI. 
This bit is always set to 1. 
0 lOS 1/O Space: MediaGX processor will not respond to I/O accesses from the PCI. 


This bit is always set to 1. 


Index 06h-07h PCI Device Status Register (RO, R/W Clear) Default Value = 0280h 
15 DPE Detected Parity Error: When a parity error is detected, this bit is set to 1. 
This bit can be cleared to 0 by writing a 1 to it. 
14 SSE Signaled System Error: This bit is set whenever SERR# is driven active. 
13 RMA Received Master Abort: This bit is set whenever a master abort cycle occurs. A master abort will 


occur whenever a PCI cycle is not claimed except for special cycles. 
This bit can be cleared to 0 by writing a 1 to it. 


12 RTA Received Target Abort: This bit is set whenever a target abort is received while the MediaGX proces- 
sor is master of the cycle. 


This bit can be cleared to 0 by writing a 1 to it. 
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Table 4-41 PCI Configuration Registers (cont.) 


Bit Name Description 


11 STA Signaled Target Abort: This bit is set whenever the MediaGX processor signals a target abort. A tar- 
get abort is signaled when an address parity occurs for an address that hits in the MediaGX proces- 
sor’s address space. 


This bit can be cleared to 0 by writing a 1 to it. 


10:9 DT Devise Timing: 
00 = Fast 
01 = Medium 
10 = Slow 


11 = Reserved 

The MediaGX processor performs medium DEVSEL# active for addresses that hit into the MediaGX 
processor address space. These two bits are always set to 01. 

8 DPD Data Parity Detected: This bit is set when three conditions are met. 

1) MediaGX processor asserted PERR# or observed PERR# asserted; 

2) MediaGX processor is the master for the cycle in which the PERR# occurred; and 

3) PE (bit 6 of Command Register) is enabled. 

This bit can be cleared to 0 by writing a 1 to it. 


7 FBS Fast Back-to-Back Capable: As a target, the processor is capable of accepting Fast Back-to-Back 
transactions. 


This bit is always set to 1. 


6:0 RSVD Reserved: Set to 0. 
Index 08h Revision Identification Register (RO) Default Value = 00h 
7:0 RID (RO) Revision ID (Read Only): This register contains the revision number of the MediaGX design. 
Index 09h-OBh Class Code Register (RO) Default Value = 060000h 
23:16 CLASS Class Code: The class code register is used to identify the generic function of the device. The 


MediaGxX processor is classified as a host bridge device (06). 
15:0 RSVD (RO) | Reserved (Read Only) 


Index 0Ch Cache Line Size Register (RO) Default Value = 00h 


7:0 CACHELINE | Cache Line Size (Read Only): The cache line size register specifies the system cacheline size in 
units of 32-bit words. This function is not supported in the MediaGX Processor. 


Index ODh Latency Timer Register (R/W) Default Value = 00h 
7:5 RSVD Reserved: Set to 0. 
4:0 LAT_TIMER | Latency Timer: The latency timer as used in this implementation will prevent a system lockup result- 


ing from a slave the does not responded to the master. If the register value is set to 00h, the timer is 
disabled. Otherwise, Timer represents the 5 MSBs of an 8-bit counter. The counter will reset on each 
valid data transfer. If the counter expires before the next TRDY# is received active, then the slave is 
considered to be incapable of responding, and the master will stop the transaction with a master abort 
and flag an SERR# active. This would also keep the master from being retried forever by a slave 
device that continues to issue retries. In these cases, the master will also stop the cycle with a master 
abort. 


Index OEh-3Fh Reserved Default Value = 00h 
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Table 4-41 


PCI Configuration Registers (cont.) 


Bit 


Name 


Description 


Index 40h 


PCI Control Function 1 Register (R/W) Default Value = 00h 


7 


RSVD 


Reserved: Set to 0. 


6 


SW 


Single Write Mode: PCI slave supports: 
0 = Multiple PCI write cycles 


1 = Single cycle write transfers on the PCI bus. The slave will perform a target disconnect with the first 
data transferred. 


SR 


Single Read Mode: PCI slave supports: 
0 = Multiple PCI read cycles. 


1 = Single cycle read transfers on the PCI bus. The slave will perform a target disconnect with the first 
data transferred. 


RXBNE 


Force Retry when X-Bus Buffers are Not Empty: 


0 = PCI slave accepts the PCI cycle with data in the PCI master write buffers. The data in the PCI 
master write buffers will not be affected or corrupted. The PCI master holds request active indicating 
the need to access the PCI bus. 


1 = PCI slave retries cycles if the PCI master X-bus write buffers contain buffered data. 


SWBE 


PCI Slave Write Buffer Enable: PCI slave write buffers: 0 = Disable; 1 = Enable. 


CLRE 


PCI Cache Line Read Enable: Read operations from the PCI into the MediaGX processor: 


0 = Single cycle unless a read multiple or memory read line command is used. 
1 = Cause a cache line read to occur. 


XBE 


X-Bus Burst Enable: PCI slave acting as a master performs burst cycles on the X-bus on write-back 
invalidate cycles from the PCI. 0 = Disable; 1 = Enable. 


(This bit does not control read bursting; bit 2 does.) 


RSVD 


Reserved — Should return a value of 0. 


Index 41h 


PCI Control Function 2 Register (R/W) Default Value = 96h 


RSVD 


Reserved: Set to 0. 


RW_CLK 


RAW Clock: A debug signal used to view internal clock operation. 0 = Disable; 1 = Enable. 


PFS 


PERR# forces SERR#: PCI master drives an active SERR# anytime it also drives or receives an 
active PERR#: 0 = Disable; 1 = Enable. 


XWB 


X-Bus to PCI Write Buffer: Enable MediaGX processor PCI master’s X-Bus write buffers (non-locked 
memory cycles are buffered, I/O cycles and lock cycles are not buffered): 0 = Disable; 1 = Enable. 


3:2 


SDB 


Slave Disconnect Boundary: PCI slave issues a disconnect with data when it crosses line boundary: 
00 = 128 bytes 

01 = 256 bytes 

10 = 512 bytes 

11 = 1024 bytes 


Works in conjunction with bit 1. 


SDBE 


Slave Disconnect Boundary Enable: 
0 = PCI slave disconnects on boundaries set by bits [3:2]. 
1 = PCI disconnects on cache line boundary which is 16 bytes. 


XWS 


X-Bus Wait State Enable: The PCI slave acting as a master on the X-bus will insert wait states on 
write cycles for data setup time. 0 = Disable; 1 = Enable. 
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Table 4-41 PCI Configuration Registers (cont.) 


Bit Name Description 
Index 43h PCI Arbitration Control 1 Register (R/W) Default Value = 80h 
7 BG Bus Grant: 


0 = Grants bus regardless of X-bus buffers. 
1 = Grants bus only if X-bus buffers are empty. 


RSVD Reserved: Set to 1. 


RME2 REQ2# Retry Mask Enable: Arbiter allows the REQ2# to be masked based on the master retry mask 
in bits [2:1]: 0 = Disable; 1 = Enable. 


4 RME1 REQ1# Retry Mask Enable: Arbiter allows the REQ1# to be masked based on the master retry mask 
in bits [2:1]: 0 = Disable; 1 = Enable. 

3 RMEO REQO# Retry Mask Enable: Arbiter allows the REQO# to be masked based on the master retry mask 
in bits [2:1]: 0 = Disable; 1 = Enable. 

2:1 MRM Master Retry Mask: When a target issues a retry to a master, the arbiter can mask the request from 


the retried master in order to allow other lower order masters to gain access to the PCI bus: 


00 = No retry mask 

01 = Mask for 16 PCI clocks 
10 = Mask for 32 PCI clocks 
11 = Mask for 64 PCI clocks 


0 HXR Hold X-bus on Retries: Arbiter holds the X-Bus X_HOLD for 2 additional clocks to see if the retried 
master will request the bus again: 0 = Disable; 1 = Enable 


(This may prevent retry thrashing in some cases.) 


Index 44h PCI Arbitration Control 2 Register (R/W) Default Value = 00h 


7 PP Ping-Pong: 
0 = Arbiter grants the processor bus per the setting of bits [2:0]. 
1 = Arbiter grants the processor bus ownership of the PCI bus every other arbitration cycle. 


6:4 FAC Fixed Arbitration Controls: These bits control the priority under fixed arbitration. The priority table is 

as follows (priority listed highest to lowest): 

000 = REQO#, REQ1#, REQ2# 

001 = REQ1#, REQO0#, REQ2# 

010 = REQO#, REQ2#,REQ1# 

011 = Reserved 

100 = REQ1#, REQ2#, REQO# 

101 = Reserved 

110 = REQ2#, REQ1#, REQO# 

111 = REQ2#, REQ0#, REQ1# 

Note: The rotation arbitration bits [2:0] must be set to 000 for full fixed arbitration. If rotation bits are 
not set to 000, then hybrid arbitration will occur. If Ping-Pong is enabled (bit 7 = 1), the proces- 
sor will have priority every other arbitration. In this mode, the arbiter grants the PCI bus to a 
master and ignores all other requests. When the master finishes, the processor will be guaran- 
teed access. At this point PCI requests will again be recognized. This will switch arbitration 
from CPU-to-PCl to CPU-to-PCl, etc. 


3 RSVD Reserved: Set to 0. 
2:0 RAC Rotating Arbitration Controls: These bits control the priority under Rotating arbitration. 


000 = Fixed arbitration will occur. 
111 = Full rotating arbitration will occur. 


When these bits are set to other values, hybrid arbitration will occur. 
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4.6.8 PCI Cycles 


The following sections and diagrams provide the 
functional relationships for PCI cycles. 


4.6.8.1 PCI Read Transaction 


A PCI read transaction consists of an address 
phase and one or more data phases. Data phases 
may consist of wait cycles and a data transfer. 
Figure 4-18 illustrates a PCI read transaction. In 
this example, there are three data phases. 


The address phase begins on clock 2 when 
FRAME# is asserted. During the address phase, 
AD[31:0] contains a valid address and C/BE[3:0]# 


contains a valid bus command. The first data 
phase begins on clock 3. During the data phase, 
AD[81:0] contains data and C/BE[3:0]# indicate 
which byte lanes of AD[81:0] carry valid data. The 
first data phase completes with zero delay cycles. 
However, the second phase is delayed one cycle 
because the target was not ready so it deasserted 
TRDY# on clock 5. The last data phase is delayed 
one cycle because the master deasserted IRDY# 
on clock 7. 


For additional information refer to Chapter 3.3.1, 
Read Transaction, of the PCI Local Bus Specifica- 
tion, Revision 2.1. 
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4.6.8.2 PCI Write Transaction 


A PCI write transaction is similar to a PCI read 
transaction, consisting of an address phase and 
one or more data phases. Since the master 
provides both address and data, no turnaround 
cycle is required following the address phase. The 
data phases work the same for both read and write 
transactions. Figure 4-19 illustrates a write transac- 
tion. 


The address phase begins on clock 2 when 
FRAME is asserted. The first and second data 
phases complete without delays. During data 
phase 3, the target inserts three wait cycles by 
deasserting TRDY#. 


For additional information refer to Chapter 3.3.2, 
Write Transaction, of the PCI Local Bus Specifica- 
tion, Revision 2.1. 
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Figure 4-19 Basic Write Operation 


Page 186 Cyrix Corporation Confidential GXm_db_v2.0 


PCI Controller A | 


4.6.8.3. PCI Arbitration 


An agent requests the bus by asserting its REQ#. 
Based on the arbitration scheme set in the PCI 
Arbitration Control 2 Register (Index 44h), the GX 
PCI arbiter will grant the request by asserting 
GNT#. Figure 4-20 illustrates basic arbitration. 


REQ#-a is asserted at clock 1. The PCI MediaGX 
processor arbiter grants access to Agent A by 
asserting GNT#-a on clock 2. Agent A must begin 
a transaction by asserting FRAME# within 16 
clocks, or the GX PCI arbiter will remove GNT#. 
Also, it is possible for Agent A to lose bus owner- 
ship sooner if another agent with higher priority 
requests the bus. However, in this example, Agent 
A starts the transaction on clock 3 by asserting 
FRAME# and completes its transaction. Since 


Agent A requests another transaction, REQ#-a 
remains asserted. When FRAME is asserted on 
clock 3, the MediaGX processor’s PCI arbiter 
determines Agent B should go next, asserts GNT#- 
b and deasserts GNT#-a on clock 4. Agent B 
requires only a single transaction. It completes the 
transaction, then deasserts FRAME# and REQ#-b 
on clock 6. The MediaGX processor’s PCI arbiter 
can then grant access to agent A, and does so on 
clock 7. Note that all buffers must flush before a 
grant is given to a new agent. 


For additional information refer to Chapter 3.4.1, 
Arbitration Signaling Protocol, of the PCI Local Bus 
Specification, Revision 2.1. 
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4.6.8.4 PCI Halt Command 


Halt is a broadcast message from the processor 
indicating it has executed a halt instruction. The 
PCI Special Cycle command is used to broadcast 
the message to all agents on the bus segment. 
During the address phase of the Halt Special cycle, 
C/BE[3:0]# = 0001 and AD[31:0] are driven to 
random values. During the data phase, C/BE[3:0]# 
= 1100 indicating bytes 1 and 0 are valid and 
AD[15:0] = 0001h. 


For additional information, refer to Chapter 3.7.2, 
Special Cycle, and Appendix A, Special Cycle 
Messages, of the PCI Local Bus Specification, 
Revision 2.1. 
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5 Virtual Subsystem Architecture 


This section describes the Cyrix Virtual Subsystem 
Architecture™ (VSA™) as implemented with the 
MediaGX processor(s) and Cyrix VSA enhanced 
I/O Companion device(s). VSA provides a frame- 
work to enable software implementation of tradi- 
tionally hardware-only components. VSA software 
executes in System Management Mode (SMM), 
enabling it to execute transparently to the oper- 
ating system, drivers and applications. 


The VSA design is based upon a simple model for 
replacing hardware components with software. 
Hardware to be virtualized is merely replaced with 
simple access detection circuitry which asserts the 
processor’s SMI# (System Management Interrupt) 
pin when hardware accesses are detected. The 
current execution stream is immediately 
preempted, and the processor enters SMM. The 
SMM system software then saves the processor 
state, initializes the VSA execution environment, 
decodes the SMI source and dispatches handler 
routines which have registered requests to service 
the decoded SMI source. Once all handler routines 
have completed, the processor state is restored 
and normal execution resumes. In this manner, 
hardware accesses are transparently replaced with 
the execution of SMM handler software. 


Historically, SMM software was used primarily for 
the single purpose of facilitating active power 
management for notebook designs. That soft- 
ware’s only function was to manage the power up 
and down of devices to save power. With high 
performance processors now available, it is 
feasible to implement, primarily in SMM software, 
PC capabilities traditionally provided by hardware. 
In contrast to power management code, this virtu- 
alization software generally has strict performance 
requirements to prevent application performance 
from being significantly impacted. 


Several functions can be virtualized in a MediaGX 
processor based design using the VSA environ- 
ment. The VSA enhanced chipsets provide 
programmable resources to trap both memory and 
I/O accesses. However, specific hardware is 
included to support the virtualization of VGA core 
compatibility and audio functionality in the system. 


The hardware support for VGA emulation resides 
completely inside the MediaGX processor. Legacy 
VGA accesses do not generate off-chip bus cycles. 
However, the VSA support hardware for 
XpressAUDIO™ resides in the I/O Companion 
device (i.e., Cx5520 and Cx5530) and is described 
in their respective specification(s). 
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5.1 Virtual VGA 


The MediaGX processor reduces the burden of 
PC- legacy hardware by using a balanced mix of 
hardware and software to provide the same func- 
tionality. The graphics pipeline contains full hard- 
ware support for the VGA “front-end”, the logic that 
controls read and write operations to the VGA 
frame buffer (located in graphics memory). For 
some modes, the hardware can also provide direct 
display of the data in the VGA buffer. Virtual VGA 
traps frame buffer accesses only when necessary, 
but it must trap all VGA I/O accesses to maintain 
the VGA state and properly program the graphics 
pipeline and display controller. 


VGA functionality with the MediaGX processor 
includes the standard VGA modes (VGA, EGA, 
CGA, and MDA) as well as the higher-resolution 
VESA modes. The CGA and MDA modes (modes 
0 through 7) require that Virtual VGA convert the 
data in the VGA buffer to a separate 8-BPP frame 
buffer that the hardware can use for display 
refresh. 


The remaining modes, VGA, EGA, and VESA, can 
be displayed directly by the hardware, with no data 
conversion required. For these modes, Virtual VGA 
outperforms typical VGA cards because the frame 
buffer data does not travel across an external bus. 


Display drivers for popular GUI (graphical user 
interface) based operating systems are provided 
by Cyrix which enable a full featured 2D hardware 
accelerator to be used instead of the emulated 
VGA core. 


5.1.1 Traditional VGA Hardware 


A VGA card consists of display memory and 
control registers. The VGA display memory shows 
up in system memory between addresses AOQO00Oh 
and BFFFFh. It is possible to map this memory to 
three different ranges within this 128KB block. 


The first range is 

- A0000h to BOOOOh for EGA and VGA modes, 
the second range is 

- BOO000h to B7FFFh for MDA modes, 
and the third range is 

- B8000h to BFFFFh for CGA modes. 


The VGA control registers are mapped to the I/O 
address range from 3BOh to 3DFh. The VGA regis- 
ters are accessed with an indexing scheme that 
provides more registers than would normally fit into 
this range. Some registers are mapped at two loca- 
tions, one for monochrome, and another for color. 


The VGA hardware can be accessed by calling 
BIOS routines or by directly writing to VGA memory 
and control registers. DOS always calls BIOS to 
set up the display mode and render characters. 
Many other applications access the VGA memory 
and control registers directly. The VGA card can be 
set up to a virtually unlimited number of modes. 
However, many applications use one of the 
predefined modes specified by the BIOS routine 
which sets up the display mode. The predefined 
modes are translated into specific VGA control 
register setups by the BIOS. The standard modes 
supported by VGA cards are shown in Table 5-1. 
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Table 5-1 Standard VGA Modes 
Text or 

Category Mode Graphics Resolution Format Type 
Software 0,1 Text 40x25 Characters CGA 
2,3 Text 80x25 Characters CGA 

4,5 Graphics 320x200 2 BPP CGA 

6 Graphics 640x200 1 BPP CGA 

7 Text 80x25 Characters MDA 

Hardware ODh Graphics 320x200 4 BPP EGA 
OEh Graphics 640x200 4 BPP EGA 

OFh Graphics 640x350 1 BPP EGA 

10h Graphics 640x350 4 BPP EGA 

11h Graphics 640x480 1 BPP VGA 

12h Graphics 640x480 4 BPP VGA 

13h Graphics 320x200 8 BPP VGA 


A VGA is made up of several functional units. 


¢ The frame buffer is 256KB of memory that 
provides data for the video display. It is orga- 
nized as 64K 32-bit DWORDs. 


« The sequencer decomposes word and DWORD 
CPU accesses into byte operations for the 
graphics controller. It also controls a number of 
miscellaneous functions, including reset and 
some clocking controls. 


¢ The graphics controller provides most of the 
interface between CPU data and the frame 
buffer. It allows the programmer to read and 
write frame buffer data in different formats. Plus 
provides ROP (raster operation) and masking 
functions. 


¢« The CRT controller provides video timing 
signals and address generation for video 
refresh. It also provides a text cursor. 


¢ The attribute controller contains the video 
refresh datapath, including text rasterization and 
palette lookup. 


« The general registers provide status informa- 
tion for the programmer as well as control over 
VGA-host address mapping and clock selection. 
This is all handled in hardware by the graphics 
pipeline. 


It is important to understand that a VGA is 
constructed of numerous independent functions. 
Most of the register fields correspond to controls 
that were originally built out of discrete logic or 
were part of a dedicated controller such as the 
6845. The notion of a VGA “mode’ is a higher-level 
convention to denote a particular set of values for 
the registers. Many popular programs do not use 
standard modes, preferring instead to produce 
their own VGA setups that are optimal for their 
purposes. 
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5.1.1.1. VGA Memory Organization 


The VGA memory is organized as 64K 32-bit 
DWORDs. This organization is usually presented 
as four 64KB “planes”. A plane consists of one byte 
out of every DWORD. Thus, plane 0 refers to the 
least significant byte from every one of the 64K 
DWORDs. The addressing granularity of this 
memory is a DWORD, not a byte; that is, consecu- 
tive addresses refer to consecutive DWORDs. The 
only provision for byte-granularity addressing is the 
four-byte enable signals used for writes. In C 
parlance, 


single_plane_byte = (dword_fb[address] >> 
(plane * 8)) & OxFF; 


When dealing with VGA, it is important to recog- 
nize the distinction between host addresses, frame 
buffer addresses, and the refresh address pipe. A 
VGA controller contains lots of hardware to trans- 
late between these address spaces in different 
ways, and understanding these translations is crit- 
ical to understanding the entire device. In standard 
four-plane graphics modes, a frame-buffer 
DWORD provides eight 4-bit pixels. The left-most 
pixel comes from bit 7 of each plane, with plane 3 
providing the most significant bit. 


pixel[i].bit[j] = dword_fb[address].bit[j*8 + (7-i)] 


5.1.1.2 VGA Front End 


The VGA front end consists of address and data 
translations between the CPU and the frame 
buffer. This functionality is contained within the 
graphics controller and sequencer components. 
Most of the front end functionality is implemented 
in the VGA read and write hardware of the 
MediaGX processor. An important axiom of the 
VGA is that the front end and back end are 
controlled independently. There are no register 
fields that control the behavior of both pieces. 
Terms like “VGA odd/even mode” are therefore 
somewhat misleading; there are two different 
controls for odd/even functionality in the front end, 
and two separate controls in the refresh path to 
cause “sensible” refresh behavior for frame buffer 


contents written in odd/even mode. Normally, all 
these fields would be set up together, but they 
don’t have to be. This sort of orthogonal behavior 
gives rise to the enormous number of possible 
VGA “modes”. The CPU end of the read and write 
pipes is one byte wide. Word and DWORD 
accesses from the CPU to VGA memory are 
broken down into multiple byte accesses by the 
sequencer. For example, a word write to AO000h 
(in a VGA graphics mode) is processed as if it were 
two-byte write operations to A0000h and AO0O0th. 


5.1.1.3 Address Mapping 


When a VGA card sees an address on the host 
bus, bits [31:15] determine whether the transaction 
is for the VGA. Depending on the mode, addresses 
OOOAXXXX, OOOB{OXXX}XXxX, or 
OOOB{1XXX}XXXX can decode into VGA space. If 
the access is for the VGA, bits [15:0] provide the 
DWORD address into the frame buffer (however, 
see odd/even and Chain 4 modes, below). Thus, 
each byte address on the host bus addresses a 
DWORD in VGA memory. 


On a write transaction, the byte enables are 
normally driven from the sequencer’s MapMask 
register. The VGA has two other write address 
mappings that modify this behavior. In odd/even 
(Chain 2) write mode, bit 0 of the address is used 
to enable bytes 0 and 2 (if zero) or bytes 1 and 3 (if 
one). In addition, the address presented to the 
frame buffer has bit 0 replaced with the PageBit 
field of the Miscellaneous Output register. Chain 4 
write mode is similar; only one of the four byte 
enables is asserted, based on bits [1:0] of the 
address, and bits [1:0] of the frame buffer address 
are set to zero. In each of these modes, the 
MapMask enables are logically ANDed into the 
enables that result from the address. 
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5.2 MediaGX™ Virtual VGA 


The MediaGX processor provides VGA compati- 
bility through a mixture of hardware and software. 
The processor core contains SMI generation hard- 
ware for VGA memory write operations. The bus 
controller contains SMI generation hardware for 
VGA I/O read and write operations. The graphics 
pipeline contains hardware to detect and process 
reads and writes to VGA memory. VGA memory is 
partitioned from system memory. 


5.2.1 Datapath Elements 


The graphics controller contains several elements 
that convert between host data and frame buffer 
data. 


The rotator simply rotates the byte written from the 
host by 0 to 7 bits to the right, based on the Rotate- 
Count field of the DataRotate register. It has no 
effect in the read path. 


The display latch is a 32-bit register that is loaded 
on every read access to the frame buffer. All 32 bits 
of the frame buffer DWORDs are loaded into the 
latch. 


The write-mode unit converts a byte from the host 
into a 32-bit value. A VGA has four write modes: 


¢ Write Mode 0: 

- Bitn of byte b comes from one of two places, 
depending on bit b of the EnableSetReset 
register. If that bit is zero, it comes from bit n 
of the host data. If that bit is one, it comes 
from bit b of the SetReset register. This mode 
allows the programmer to set some planes 
from the host data and the others from 
SetReset. 


¢ Write Mode 1: 
- All 32 bits come directly out of the display 
latch; the host data is ignored. This mode is 
used for screen-to-screen copies. 


« Write Mode 2: 

- Bit n of byte b comes from bit b of the host 
data; that is, the four LSBs of the host data 
are each replicated through a byte of the 
result. In conjunction with the BitMask 
register, this mode allows the programmer to 
directly write a 4-bit color to one or more 
pixels. 


¢ Write Mode 3: 

- Bitn of byte b comes from bit b of the 
SetReset register. The host data is ANDed 
with the BitMask register to provide the bit 
mask for the write (see below). 


The read mode unit converts a 32-bit value from 
the frame buffer into a byte. A VGA has two read 
modes: 


*« Read Mode 0: 

- One of the four bytes from the frame buffer is 
returned, based on the value of the Read- 
MapSelect register. In Chain 4 mode, bits 
[1:0] of the read address select a plane. In 
odd/even read mode, bit 0 of the read 
address replaces bit 0 of ReadMapSelect. 


« Read Mode 1: 

- Bitn ofthe result is set to 1 if bit nin every 
byte b matches bit b of the ColorCompare 
register; otherwise it is set to 0. There is a 
ColorDon’tCare register that can exclude 
planes from this comparison. In four-plane 
graphics modes, this provides a conversion 
from 4 BPP to 1 BPP. 


The ALU is a simple two-operand ROP unit that 
operates on writes. Its operating modes are COPY, 
AND, OR, and XOR. The 32-bit inputs are: 


1) the output of the write-mode unit and 


2) the display latch (not necessarily the value at 
the frame buffer address of the write). 
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An application that wishes to performs ROPs on 
the source and destination must first byte read the 
address (to load the latch) and then immediately 
write a byte to the same address. The ALU has no 
effect in Write Mode 1. 


The bit mask unit does not provide a true bit mask. 
Instead, it selects between the ALU output and the 
display latch. The mask is an 8-bit value, and bit n 
of the mask makes the selection for bit n of all four 
bytes of the result (a zero selects the latch). No bit 
masking occurs in Write Mode 1. 


The VGA hardware of the MediaGX processor 
does not implement Write Mode 1 directly, but it 
can be indirectly implemented by setting the 
BitMask to zero and the ALU mode to COPY. 


5.2.2 Video Refresh 


VGA refresh is controlled by two units: the CRT 
controller (CRTC) and the attribute controller 
(ATTR). The CRTC provides refresh addresses 
and video control; the ATTR provides the refresh 
datapath, including pixel formatting and internal 
palette lookup. 


The VGA back end contains two basic clocks: the 
dot clock (or pixel clock) and the character clock. 
The ClockSelect field of the Miscellaneous Output 
register selects a “master clock” of either 25MHz or 
28MHz. This master clock, optionally divided by 
two, drives the dot clock. The character clock is 
simply the dot clock divided by eight or nine. 


The VGA supports four basic pixel formats. Using 
text format, the VGA interprets frame buffer values 
as ASCII characters, foreground/background 
attributes, and font data. The other three formats 
are all “graphics modes”, known as APA (All Points 
Addressable) modes. These formats could be 
called CGA-compatible (odd/even four bits/pixel), 
EGA-compatible (4-plane four bits/pixel), and VGA- 
compatible (pixel-per-byte eight bits/pixel). The 
format is chosen by the ShiftRegister field of the 
Graphics Controller Mode register. 


The refresh address pipe is an integral part of the 
CRTC, and has many configuration options. 
Refresh can begin at any frame buffer address. 
The display width and the frame buffer pitch (scan- 
line delta) are set separately. Multiple scan lines 
can be refreshed from the same frame buffer 
addresses. The LineCompare register causes the 
refresh address to be reset to zero at a particular 
scan line, providing support for vertical split- 
screen. 


Within the context of a single scan line, the refresh 
address increments by one on every character 
clock. Before being presented to the frame buffer, 
refresh addresses can be shifted by 0, 1, or 2 bits 
to the left. These options are often mis-named 
Byte, Word, and Doubleword modes. Using this 
shifter, the refresh unit can be programmed to skip 
one out of two or three out of four DWORDs of 
refresh data. As an example of the utility of this 
function, consider Chain 4 mode, described earlier. 
Pixels written in Chain 4 mode occupy one out of 
every four DWORDs in the frame buffer. If the 
refresh path is put into “Doubleword” mode, the 
refresh will come only from those DWORDs writ- 
able in Chain 4. This is how VGA mode 13h works. 


In text mode, the ATTR has a lot of work to do. At 
each character clock, it pulls a DWORD of data out 
of the frame buffer. In that DWORD, plane 0 
contains the ASCII character code, and plane 1 
contains an attribute byte. The ATTR uses plane 0 
to generate a font lookup address and read 
another DWORD. In plane 2, this DWORD 
contains a bit-per-pixel representation of one scan 
line in the appropriate character glyph. The ATTR 
transforms these bits into eight pixels, obtaining 
foreground and background colors from the 
attribute byte. The CRTC must refresh from the 
same memory addresses for all scan lines that 
make up a character row; within that row, the ATTR 
must fetch successive scan lines from the glyph 
table so as to draw proper characters. Graphics 
modes are somewhat simpler. In CGA-compatible 
mode, a DWORD provides eight pixels. The first 
four pixels come from planes 0 and 2; each 4-bit 
pixel gets bits [3:2] from plane 2, and bits [1:0] from 
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plane 0. The remaining four pixels come from 
planes 1 and 3. The EGA-compatible mode also 
gets eight pixels from a DWORD, but each pixel 
gets one bit from each plane, with plane 3 
providing bit 3. Finally, VGA-compatible mode gets 
four pixels from each DWORD; plane 0 provides 
the first pixel, plane 1 the next, and so on. The 8 
BPP mode uses an option to provide every pixel for 
two dot clocks, thus allowing the refresh pipe to 
keep up (it only increments on character clocks) 
and meaning that the 320-pixel-wide mode 13h 
really has 640 visible pixels per line. The VGA 
color model is unusual. The ATTR contains a 16- 
entry color palette with 6 bits per entry. Except for 8 
BPP modes, all VGA configurations drive four bits 
of pixel data into the palette, which produces a 6-bit 
result. Based on various control registers, this 
value is then combined with other register contents 
to produce an 8-bit index into the DAC. There is a 
ColorPlaneEnable register to mask bits out of the 
pixel data before it goes to the palette; this is used 
to emulate four-color CGA modes by ignoring the 
top two bits of each pixel. In 8 BPP modes, the 
palette is bypassed and the pixel data goes directly 
to the DAC 


5.2.3 MediaGX VGA Hardware 


The MediaGX processor core contains hardware to 
detect VGA accesses and generate SMI interrupts. 
The graphics pipeline contains hardware to detect 
and process reads and writes to VGA memory. The 
VGA memory on the MediaGX processor is parti- 
tioned from system memory. The MediaGX 
processor has the following hardware components 
to assist the VGA emulation software. 


¢ SMI Generation 

« VGA Range Detection 

« VGA Sequencer 

¢ VGA Write/Read Path 

¢« VGA Address Generator 
* VGA Memory 


5.2.3.1. SMI Generation 


VGA emulation software is notified of VGA memory 
accesses by an SMI generated in dedicated 
circuitry in the processor core that detects and 
traps memory accesses. The SMI generation hard- 
ware for VGA memory addresses is in the second 
stage of instruction decoding on the processor 
core. This is the earliest stage of instruction 
decode where virtual addresses have been trans- 
lated to physical addresses. Trapping after the 
execution stage is impractical, because memory 
write buffering will allow subsequent instructions to 
execute. 


The VGA emulation code requires the SMI to be 
generated immediately when a VGA access 
occurs. The SMI generation hardware can option- 
ally exclude areas of VGA memory, based on a 32- 
bit register which has a control bit for each 2KB 
region of the VGA memory window. The control bit 
determines whether or not an SMI interrupt is 
generated for the corresponding region. The 
purpose of this hardware is to allow the VGA 
emulation software to disable SMI interrupts in 
VGA memory regions that are not currently 
displayed. 


For direct display modes (8 BPP or 16 BPP) in the 
display controller, Virtual VGA can operate without 
SMI generation. 


The SMI generation circuit on the MediaGX 
processor has configuration registers to control 
and mask SMI interrupts in the VGA memory 
space. 
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5.2.3.2 VGA Memory Addresses 


SMI generation can be configured to trap VGA 
memory accesses in one of the following ranges: 


A0000h to AFFFFh (EGA,VGA), 
BO000h to B7FFFh (MDA), 
or B8000h to BEFFFh (CGA). 


Range selection is accomplished through program- 
mable bits in the VGACTL register (Index B9h). 
Fine control can be exercised within the range 
selected to allow off-screen accesses to occur 
without generating SMls. 


SMI generation can also separately control the 
following I/O ranges: 3BOh to 3BFh, 3COh to 3CFh, 
and 3D0h to 3DFh. The BC_XMAP_1 register 
(GX_BASE+8004h) in the Internal Bus Interface 
Unit has an enable/disable bit for each of the 
address ranges above. 


5.2.3.3. VGA Configuration Registers 
Table 5-2 summarizes the VGA Configuration 


Registers. Detailed register/bit formats are given in 
Table 5-3. 


Table 5-2. VGA Configuration Registers Summary 


5.2.3.4 VGA Control Register 


The VGA control register (VGACTL) provides 
control for SMI generation through an enable bit for 
memory address ranges A0000h to BFFFFh. Each 
bit controls whether or not SMI is generated for 
accesses to the corresponding address range. The 
default value of this register is zero so that VGA 
accesses will not be trapped on systems with an 
external VGA card. 


5.2.3.5 VGA Mask Registers 


The VGA Mask register (VGAM) has 32 bits that 
can selectively mask 2KB regions within the VGA 
memory region A0000h to AFFFFh. If none of the 
three regions is enabled in VGACTL, then the 
contents of VGAM are ignored. VGAM can be used 
to prevent the occurrence of SMI when non- 
displayed VGA memory is accessed. This is an 
enhancement that improves performance for 
double-buffered applications only. 


Index Name Description Default 
B9h VGACTL VGA Control Register 00h (SMI generation disabled) 
BAh-BDh VGAM VGA Mask Register Don’t Care 
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Table 5-3 VGA Configuration Registers 


Bit Description 


Index B9h VGACTL Register (R/W) Default Value = 00h 
7:3 Reserved: Set to 0. 
2 SMI generation for VGA memory range B8000h to BFFFFh: 0 = Disable; 1 = Enable 
1 SMI generation for VGA memory range BO000h to B7FFFh: 0 = Disable; 1 = Enable. 
0 SMI generation for VGA memory range A0000h to AFFFFh: 0 = Disable; 1 = Enable 
Index BAh-BDh VGAM Register (R/W) Default Value = xxxxxxxxh 


31 SMI generation for address range AF800h to AFFFFh: 0 = Disable; 1 = Enable. 
30 SMI generation for address range AFOOOh to AF7FFh: 0 = Disable; 1 = Enable. 
29 SMI generation for address range AE800h to AEFFFh: 0 = Disable; 1 = Enable. 
28 SMI generation for address range AEOO0O0h to AE7FFh: 0 = Disable; 1 = Enable. 
27 SMI generation for address range AD800h to ADFFFh: 0 = Disable; 1 = Enable. 
26 SMI generation for address range ADOOOh to AD7FFh: 0 = Disable; 1 = Enable. 
25 SMI generation for address range AC800h to ACFFFh: 0 = Disable; 1 = Enable. 
24 SMI generation for address range ACO0Oh to AC7FFh: 0 = Disable; 1 = Enable. 
23 SMI generation for address range AB800h to ABFFFh: 0 = Disable; 1 = Enable. 
22 SMI generation for address range ABOOOh to AB7FFh: 0 = Disable; 1 = Enable. 
21 SMI generation for address range AA800h to AAFFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range AAOOOh to AA7FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A9800h to A9FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A9000h to A97FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A8800h to A8FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A8000h to A87FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A7800h to A7FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A7000h to A77FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A6800h to A6FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A6000h to A67FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A5800h to A5FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A5000h to A57FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A4800h to A4FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A4000h to A47FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A3800h to A3FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A3000h to A37FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A2800h to A2FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A2000h to A27FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A1800h to A1FFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A1000h to A17FFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A0800h to AOFFFh: 0 = Disable; 1 = Enable. 
SMI generation for address range A0000h to A07FFh: 0 = Disable; 1 = Enable. 
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5.2.3.6 VGA Range Detection 


The VGA range detection circuit is similar to the 
SMI generation hardware, however, it resides in 
the bus controller address mapping unit. The 
purpose of this hardware is to notify the graphics 
pipeline when accesses to the VGA memory range 
A0000h to BFFFFh are detected. The graphics 
pipeline has VGA read and write path hardware to 
process VGA memory accesses. The VGA range 
detection can be configured to trap VGA memory 
accesses in one or more of the following ranges: 
A0000h to AFFFFh (EGA,VGA), BO000h to 
B7FFFh (MDA), or B8000h to BFFFFh (CGA). 


5.2.3.7 VGA Sequencer 


The VGA sequencer is located at the front end of 
the graphics pipeline. The purpose of the VGA 
sequencer is to divide up multiple-byte read and 
write operations into a sequence of single-byte 
read and write operations. 16-bit or 32-bit X-bus 
write operations to VGA memory are divided into 8- 
bit write operations and sent to the VGA write path. 
16-bit or 32-bit X-bus read operations from VGA 
memory are accumulated from 8-bit read opera- 
tions over the VGA read path. The sequencer 
generates the lower two bits of the address. 


5.2.3.8 VGA Write/Read Path 


The VGA write path implements standard VGA 
write operations into VGA memory. No SMI is 
generated for write path operations when the VGA 
access is not displayed. When the VGA access is 
displayed, an SMI is generated so that the SMI 
emulation can update the frame buffer. The VGA 
write path converts 8-bit write operations from the 
sequencer into 32-bit VGA memory write opera- 
tions. The operations performed by the VGA write 
path include data rotation, raster operation (ALU), 
bit masking, plane select, plane enable, and write 
modes. 


The VGA read path implements standard VGA 
read operations from VGA memory. No SMI is 
needed for read-path operations. The VGA read 
path converts 32-bit read operations from VGA 
memory to 8-bit data back to the sequencer. The 
basic operations performed by the VGA read path 
include color compare, plane-read select, and read 
modes. 


5.2.3.9 VGA Address Generator 


The VGA address generator translates VGA 
memory addresses up to address where the VGA 
memory resides on the MediaGX processor. The 
VGA address generator requires the address from 
the VGA access (A0000h to BFFFFh), the base of 
the VGA memory on the MediaGX processor, and 
various control bits. The control bits are necessary 
because addressing is complicated by odd/even 
and Chain 4 addressing modes. 


5.2.3.10 VGA Memory 


The VGA memory requires 256KB of memory 
organized as 64KB by 32 bits. The VGA memory is 
implemented as part of system memory. The 
MediaGx processor partitions system memory into 
two areas, normal system memory and graphics 
memory. System memory is mapped to the normal 
physical address of the DRAM, starting at zero and 
ending at memory size. Graphics memory is 
mapped into high physical memory, contiguous to 
the registers and dedicated cache of the MediaGX 
processor. The graphics memory includes the 
frame buffer, compression buffer, cursor memory, 
and VGA memory. The VGA memory is mapped on 
a 256KB boundary to simplify the address genera- 
tion. 
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5.2.4 VGA Video BIOS 


The video BIOS supports the VESA BIOS Exten- 
sions (VBE) Version 1.2 and 2.0, as well as all 
standard VGA BIOS calls. It interacts with Virtual 
VGA through the use of several extended VGA 
registers. These are virtual registers contained in 
the VSA code for Virtual VGA. (These registers are 
defined in a separate document.) 


Table 5-4 Virtual VGA Register Summary 


5.2.5 Virtual VGA Register 


Descriptions 
This section describes the registers contained in 
the graphics pipeline used for VGA emulation. The 
graphics pipeline maps 200h locations starting at 
GX_BASE+8100h. Refer to Section 4.1.2 “Control 
Registers” on page 106 for instructions on 
accessing these registers. 


The registers are summarized in Table 5-4, 
followed by detailed bit formats in Table 5-5. 


GX_BASE+ 
Memory Offset Type Function Default Value 
8210h-8213h R/W GP_VGA_BASE VGA XXXXXXXXN 
Graphics Pipeline VGA Memory Base Address Register — Specifies the offset 
of the VGA memory, starting from the base of graphics memory. 
8214h-8217h R/W GP_VGA_LATCH XXXXXXXXN 
Graphics Pipeline VGA Display Latch Register — Provides a memory mapped 
way to read or write the VGA display latch. 
8140h-8143h R/W GP_VGA_WRITE XXXXXXXXN 
Graphics Pipeline VGA Write Patch Control Register — Controls the VGA 
memory write path in the graphics pipeline. 
8144h-8147h R/W GP_VGA_READ 00000000h 
Graphics Pipeline VGA Read Patch Control Register — Controls the VGA 
memory read path in the graphics pipeline. 
ee 
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PROCESSOR 


Table 5-5 Virtual VGA Registers 


Bit 


Name 


Description 


GX_BASE+8210h-8213h 


GP_VGA_BASE (R/W) Default Value = xxxxxxxxh 


31:14 RSVD Reserved: Set to 0. 
13:8 VGA_BASE Base Address (Read Only): The VGA base address is added to the graphics memory base to 
(RO) specify where VGA memory starts. The VGA base address provides longword address bits 19:14 
when mapping VGA accesses into graphics memory. This allows the VGA base address to start 
on any 64KB boundary within the 4MB of graphics memory. 
76 RSVD Reserved: Set to 0. 
5:0 VGA_BASE Base Address (Write Only): The VGA base address is added to the graphics memory base to 
(WO) specify where VGA memory starts. The VGA base address provides longword address bits 19:14 


when mapping VGA accesses into graphics memory. This allows the VGA base address to start 
on any 64KB boundary within the 4MB of graphics memory. 


GX_BASE+8214h-8217h 


GP_VGA_LATCH Register (R/W) Default Value = xxxxxxxxh 


31:0 


LATCH 


Display Latch: Specifies the value in the VGA display latch. VGA read operations cause VGA 
frame-buffer data to be latched in the display latch. VGA write operations can use the display latch 
as a source of data for VGA frame-buffer write operations. 


GX_BASE+8140h-8143h 


GP_VGA_WRITE Register (R/W) Default Value = xxxxxxxxh 


31:28 RSVD Reserved: Set to 0. 
27:24 MAP_MASK Map Mask: Enables planes 3 through 0 for writing. Combined with chain control to determine the 
final enables. 
23:21 RSVD Reserved: Set to 0. 
20 W3 Write Mode 3: Selects write mode 3 by using the bit mask with the rotated data. 
19 Ww2 Write Mode 2: Selects write mode 2 by controlling set/reset. 
18:16 RC Rotate Count: Controls the eight bit rotator. 
15:12 SRE Set/Reset Enable: Enables the set/reset value for each plane. 
11:8 SR Set/Reset: Selects 1 or 0 for each plane if enabled. 
7:0 BIT_MASK Bit Mask: Selects data from the data latches (last read data). 


GX_BASE+8144h-8147h 


GP_VGA_READ Register (R/W) Default Value = 00000000h 


31:18 RSVD Reserved: Set to 0. 
17:16 RMS Read Map Select: Selects which plane to read in read mode 0 (Chain 2 and Chain 4 inactive). 
15 F15 Force Address Bit 15: Forces address bit 15 to 0. 
14 PC4 Packed Chain 4:— Provides 64KB of packed pixel addressing when used with Chain 4 mode. 
This bit causes the VGA addresses to be shifted right by 2 bits. 
13 C4 Chain 4 Mode: Selects Chain 4 mode for both read operations and write operations. 
12 PB Page Bit: Becomes LSB of address if COE is set high. 
11 COE Chain Odd/Even: Selects PB rather than AO for least-significant VGA address bit. 
10 we Write Chain 2 Mode: Selects Chain 2 mode for write operations. 
9 R2 Read Chain 2 Mode: Selects Chain 2 mode for read operations. 
8 RM Read Mode: Selects between read mode 0 (normal) and read mode 1 (color compare). 
7:4 CCM Color Compare Mask: Selects planes to include in the color comparison (read mode 1). 
3:0 CC Color Compare: Specifies value of each plane for color comparison (read mode 1). 
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6 Power Management 


The power management resources provided by a 
combined MediaGX processor and 
Cx5520/Cx5530-based system have been 
designed to support a full-featured notebook imple- 
mentation. The extent to which these resources 
are employed depends on the application and the 
discretion of the system designer. 


The three greatest power consumers in a notebook 
system are the display, the hard drive and the 
CPU. Managing power for the first two is relatively 
straightforward and is discussed in the I/O 
Companion (Cx5520/Cx5530) specification(s). 
Managing CPU power can be more difficult since 
detecting inactive (Idle) states by monitoring 
external activity is imperfect as well as inefficient. 


The MediaGX processor and Cx5520/Cx5530 I/O 
Companion chip contain the most advanced power 
management features for reducing the power 
consumption of the processor in the system while 
delivering the highest performance in any mobile 
processor. The MediaGX processor supports the 
following CPU power management features: 


« APM Support 

* CPU Suspend Command Registers 
(Cx5520/Cx5530) 

* Suspend Modulation 

* 3 Volt Suspend 

« MediaGX Integrated Processor Serial Bus 


Integrated x86 Solution with MMX™ Support 


(| 
ay 


6.1 APM Support 


Many notebook computers rely solely on the APM 
eagvariced Power Management) driver for DOS™, 
Windows® 3.1 and Windows 95 operating systems 
to manage power to the CPU. APM provides 
several services that enhance the system power 
management by determining when the CPU is idle. 
For the CPU, APM is theoretically the best 
approach but there are some drawbacks. 


1. APMis an OS-specific driver which may not be 
available for some operating systems. 


2. Application support is inconsistent. Some 
applications in foreground may prevent idle 
calls. 


The components for APM support are: 


* Software CPU Suspend control via the 
Cx5520/Cx5530 CPU Suspend Command 
Register (ACh). 


* Software SMI entry via the Software SMI 
Register (DOh). This allows the APM BIOS to be 
part of the SMI handler. 
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6.2 CPU Suspend Command 
Registers 


Power management system software can invoke 
the SUSP#/SUSPA+# protocol with the “CPU 
Suspend Command” and the “Suspend Notebook 
Command?’ registers in the Cx5520/Cx5530. If the 
SUSP#/SUSPA# protocol is invoked, all pending 
SMls are serviced and SMI# is deasserted. Then 
SUSP# is asserted by the Cx5520/Cx5530 and, 
subsequently, SUSPA# is returned by the 
MediaGX processor. When a condition that ends 
the “Suspend” state exists, SMl# is re-asserted. At 
this point, if the PLL in the MediaGX processor has 
not been stopped, then SUSP# is deasserted. 
SUSP# is never deasserted until SUSPA# has 
been sampled active (low). 


Note: The SMI# pin is a unidirectional line from 
the Cx5520/Cx5530 to the MediaGX pro- 
cessor. It is active low. When SM is initi- 
ated from a normal mode, the SMI# pin is 
asserted low and is held low until the SMI 
source is cleared. At that time, SMI# is de- 
asserted. 


6.3 Suspend Modulation 


The hardware provided to support the MediaGX 
processor’s power management works by 
assuming that the MediaGX processor is Idle and 
reducing power until activity is detected. Most 
power management schemes in the industry run 
the system at full speed until a period of inactivity is 
detected. Cyrix’s more aggressive approach yields 
lower power consumption. When activity is 
detected, the MediaGX processor is instantly 
converted to full speed for a programmed duration. 
This is called Suspend Modulation. 


Suspend Modulation acts as backup for cases 
where APM doesn’t correctly detect an Idle condi- 
tion in the system. As long as it is enabled, it will 
only become active in the background. The 
“Suspend Modulation Enable Register” in the 
Cx5520/Cx5530 enables the Suspend Modulation 
feature. 


The “Suspend Modulation ON Count Register” 
(Cx5520/Cx5530) is an 8-bit counter that repre- 
sents the number of 32 Us intervals that the SUSP# 
pin will be asserted to the MediaGX processor. 
This counter, together with the “Suspend Modula- 
tion OFF Count Register” and the IRQ/Video 
Speedup Registers, performs the Suspend Modu- 
lation function for MediaGX processor’s power 
management. The ratio of the on count to the off 
count sets up an effective (emulated) clock 
frequency, allowing the power manager in the 
system to reduce the MediaGX processor’s power 
consumption. 
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6.4 3-Volt Suspend Mode 


The MediaGX processor and Cx5520/Cx5530 
support stopping the processor and system clocks 
using the 3-Volt Suspend Mode. If configured (refer 
Cx5520 or Cx5530 specification), the 
Cx5520/Cx5530 asserts the SUSP_8V pin after the 
SUSP#/SUSPA# handshake. SUSP_ 8V is 
intended to be connected to the output enable of a 
clock synthesizer or buffer chip so that the clocks 
to the MediaGX processor (SYSCLK), the 
Cx5520/Cx5530 (PCI_CLK), and other system 
devices are stopped. The SUSP_3V pin is 
asserted on any write to the Cx5520/Cx5530’s 
“CPU Suspend Command Register” or “Suspend 
Notebook Command Register” with bit 0 of the 
“Clock Stop Control Register” set. 


The MediaGX processor has two low-power 
Suspend modes. The mode implemented is deter- 
mined by bit 0 in the PM Clock Stop Control 
Register. One mode (bit 0 clear) turns off the 
internal clocks to everything except the internal 
display and memory controllers, thereby keeping 
the display active. The second mode, which is 
lower power, turns off all internal clocks generated 
from SYSCLK. This mode is selected by setting bit 
0 in the PM Clock Stop Control Register. If you are 
using DRAMs without self refresh, you must supply 
a 32 kHz clock to the CLK32KHZ bit to keep the 
refresh circuitry active when using the lower-power 
Suspend mode. 


While also in 3-Volt Suspend Mode, the 
Cx5520/Cx5530 continues to decrement all of its 
device timers, and it responds to external SMI 
interrupts using the 32 kHz clock input 
(CLK32KHz) pin. Any SMI event, timer or pin, 
causes the Cx5520/Cx5530 to deassert the 
SUSP_3V pin, starting the system clocks. The 
Cx5520/Cx5530 holds SUSP# active for a pre- 
programmed period that varies from 0 to 16 ms, 
which allows the clocks to settle. After this period 
expires, the Cx5520/Cx5530 deasserts SUSP#. 
SMI# is held active for the entire period, so that the 
MediaGxX processor status registers are updated. 


The SUSP_3V pin can be active either high or low. 
The pin is an input during POR, and is sampled to 
determine its inactive state. This allows a designer 
to match the active state of SUSP_8V to the inac- 
tive state for a clock driver output enable with a 
pull-up or pull-down resistor. 
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6.5 Suspend Mode and Bus Cycles 


The following subsections describe the bus cycles 
when the Suspend mode is implemented. 


6.5.1 Initiating Suspend with SUSP# 


The MediaGX processor has two low-power 
Suspend modes. The mode is selected by bit 0 in 
the PM Clock Stop Control Register. One mode (bit 
0 cleared) turns off the internal clocks to everything 
but the internal Display and Memory Controllers, 
keeping the display active. A lower-power mode 
turns off all internal clocks generated from 
SYSCLK. This mode is selected by setting bit 0 in 
the PM Clock Stop Control Register. If the bit is set 
and DRAMS without self-refresh are used, a 32 
KHz clock must be supplied to the CLK32KHZ 
input to keep the refresh circuit active. 


The MediaGX processor enters the Suspend mode 
in response to SUSP# input assertion only when 
certain conditions are met. First, the USE_SUSP 
bit must be set in CCR2 (Index C2h[7]). In addition, 


execution of the current instructions and any 
pending decoded instructions and associated bus 
cycles must be completed. SUSP# is sampled on 
the rising edge of SYSCLK, and must meet speci- 
fied setup and hold times to be recognized at a 
particular SYSCLK edge. 


When all conditions are met, the SUSPA# output is 
asserted. The time from assertion of SUSP# to the 
activation of SUSPA# depends on which instruc- 
tions were decoded prior to assertion of SUSP#. 
Normally, once SUSP# has been sampled inactive 
the SUSPA# output will be deactivated within two 
clocks. However, the deactivation of SUSPA# may 
be delayed until the end of an active refresh cycle. 


If the CPU is already in a Suspend mode initiated 
by SUSP#, one occurrence of NMI, INTR and SMIl# 
is stored for execution after Suspend mode is 
exited. The CPU also allows PCI accesses during 
a SUSP#-initiated Suspend mode (see Figure 6-1). 
If the CPU is in the middle of a PCI access when 
SUSP# is asserted, the assertion of SUSPA# will 
be delayed until the PCI access is completed. 


SYSCLK 


SUSP# 


SUSPA# 


Figure 6-1 SUSP#-Initiated Suspend Mode 
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6.5.2 Initiating Suspend with HALT 


The CPU also enters Suspend mode as a result of 
executing a HALT instruction if the SUSP_HALT bit 
in CCR2 (Index C2h[3]) is set. Suspend mode is 
then exited upon recognition of an NMI, an 
unmasked INTR, or an SMI#. Normally SUSPA# is 
deactivated within six SYSCLKS from the detection 
of an active interrupt. However, the deactivation of 


SUSPA# may be delayed until the end of an active 
refresh cycle. 


The CPU also allows PCI accesses during a HALT- 
initiated Suspend mode. If the CPU is in the middle 
of a PCI access when the Halt instruction is 
executed, the assertion of SUSPA# will be delayed 
until the PCI access is completed. 


SYSCLK 


FRAME# 


C/BE[3:0}# 


AD[15:0] 


IRDY# 


INTR, NMI, 


SMI# 


SUSPA# 


Figure 6-2 HALT-Initiated Suspend Mode 


GXm_db_v2.0 


Cyrix Corporation Confidential 


Page 205 


Cyrix re 
MedialXx 


M M X"*- E N H AN € E D 


6.5.3 Responding to a PCI Access asserted when the PCI access is completed, the 

During Suspend Mode MediaGX processor will assert SUSPA# and return 
to a SUSP#-initiated Suspend mode. If it was a 
HALT-initiated Suspend mode and no active inter- 
rupts have been recognized, the CPU will assert 
SUSPA# and return to a HALT-initiated Suspend 
mode. 


The MediaGX processor can temporarily exit 
Suspend mode to handle PCI accesses. If an 
unmasked REQx# is asserted, the MediaGX 
processor will deassert SUSPA# and exit the 
Suspend mode to respond to the PCI access. A 
PCI access is completed when FRAME is inactive 
and TRDY# or STOP# are active. If SUSP# is 


SYSCLK 
REQx# 
FRAME# 
TRDY# 
SUSP# 
SUSPA# 


Figure 6-3 PCI Access During Suspend Mode 
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6.5.4 Stopping the Input Clock 


Because the MediaGX processor is a static device, 
the input clock (SYSCLK) can be stopped and 
restarted without any loss of internal CPU data. If 
DRAMS are used that do not have self-refresh, bit 
0 of the PM Clock Stop Control Register must be 
set to a one and the CLK32KHZ input must be 
continuously applied to keep the refresh circuitry 
running. The SYSCLK input can be stopped at 
either a logic high or logic low state. The required 
sequence for stopping SYSCLK is to initiate CPU 
Suspend mode, wait for the assertion of SUSPA# 
by the processor, and then stop the input clock. 


The CPU remains suspended until SYSCLK is 
restarted and the Suspend mode is exited as 
described earlier. While SYSCLK is stopped, the 
processor can no longer sample and respond to 
any input stimulus including REQx#, NMI, SMI#, 
INTR, and RESET inputs. 


Figure 6-4 illustrates the recommended sequence 
for stopping the SYSCLK using SUSP# to initiate 
Suspend mode. SYSCLK may be started prior to or 
following negation of the SUSP# input. The figure 
includes the SUSP_8V pin from the 
Cx5520/Cx5530 which is used to stop the external 
clocks. 


SYSCLK 
SUSP# 
SUSPA# 


SUSP_3V 
(Cx5520/Cx5530) 


SMI Event, Timer or Pin | 


Figure 6-4 Stopping SYSCLK During Suspend Mode 
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6.6 MediaGX Processor Serial Bus 


The power management logic of the MediaGX 
processor provides the Cx5520/Cx5530 with infor- 
mation regarding the MediaGX processor produc- 
tivity. If the MediaGX processor is determined to be 
relatively inactive, the MediaGX processor power 
consumption can be greatly reduced by entering 
the Suspend Modulation mode. 


Although the majority of the system power 
management logic is implemented in the 
Cx5520/Cx5530, a small amount of logic is 
required within the MediaGX processor to provide 
information from the graphics controller that is not 
externally visible otherwise. The MediaGX 
processor implements a simple serial communica- 
tions mechanism to transmit the CPU status to the 
Cx5520/Cx5530. The MediaGX processor accu- 
mulates CPU events in a 8-bit register, “PM Serial 
Packet Register” (GX_BASE+850Ch, see Table 6- 
2), which is serially transmitted out of the MediaGX 
processor every 1 to 10 Us. The transmission 
frequency is set with the “PM Serial Packet Control 
Register” (GX_BASE+8504h, see Table 6-2). 


6.6.1 Serial Packet Transmission 


The MediaGX processor transmits the contents of 
the “PM Serial Packet Register” on the SERIALP 
output pin to the PSERIAL input pin of the 
Cx5520/Cx5530. The MediaGX processor holds 
SERIALP low until the transmission interval 
counter (GX_BASE+8504h[4:3]) has elapsed. 
Once the counter has elapsed, PSERIAL is held 
high for two SYSCLKs to indicated the start of 
packet transmission. The contents of the packet 
register are then shifted out starting from bit 7 
down to bit 0. PSERIAL is held high for one 
SYSCLK to indicate the end of packet transmission 
and then remains low until the next transmission 
interval. After the packet transmission has 
completed, the packet contents are cleared. 
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6.7 Power Management Registers 


The MediaGX processor contains the power Note, however, the PM_BASE and PM_MASK 

management registers for the serial packet trans- registers are accessed with the CPU_READ and 
mission control, the user-defined power manage- CPU_WRITE instructions. Refer to Section 4.1.6 
ment address space, Suspend Refresh, and SMI “CPU_READ/CPU_WRITE Instructions” on page 
status for Suspend/Resume. These registers are 111 for more information regarding these instruc- 


memory mapped (GX_BASE+8500h-8FFFh) in the tions. 
address space of the MediaGX processor and are 
described in the following sections. Refer to 

Section 4.1.2 “Control Registers” on page 106 for 


Table 6-1 summarizes the above mentioned regis- 
ters. Tables 6-2 and 6-3 give these register’s bit 


instructions on accessing these registers. formats. 
Table 6-1 Power Management Register Summary 
GX_BASE+ Default 
Memory Offset Type Name/Function Value 


Control and Status Registers 


8500h-8503h R/W PM_STAT_SMI XXxxxx00h 


PM SMI Status Register — Contains System Management Mode (SMM) status 
information used by SoftVGA. 


8504h-8507h R/W PM_CNTRL_TEN XXxxxx00h 


PM Serial Packet Control Register — Sets the serial packet transmission fre- 
quency and enables specific CPU events to be recorded in the serial packet. 


8508h-850Bh R/W PM_CNTRL_CSTP XXxXxxx00h 


PM Clock Stop Control Register — Enables the 3-V Suspend Mode for the Medi- 
aGX processor. 


850Ch-850Fh R/W PM_SER_PACK XXXXxXx00h 
PM Serial Packet Register — Transmits the contents of the serial packet. 


Default 
Index Type Name/Function Value 


Programmable Address Region Registers 


FFFF FF6Ch R/W | PM_BASE 00000000h 
PM Base Register — Contains the base address for the programmable memory 
range decode. This register, in combination with the PM_MASK register, is used 
to generate a memory range decode which sets bit 1 in the serial transmission 
packet. 

FFFF FF7Ch R/W_ | PM_MASK 00000000h 
PM Mask Register — The address mask for the PM_BASE register 
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Table 6-2 Power Management Control and Status Registers 


Bit Name Description 

GX_BASE+8500h-8503h PM_STAT_SMI Register (R/W) Default Value = xxxxxx00h 
31:8 RSVD Reserved — These bits are not used. Do not write to these bits. 
7:33 RSVD Reserved — Set to 0. 

2 SMI_MEM SMI VGA Emulation Memory — This bit is set high if a SMI was generated for VGA emulation in 
response to a VGA memory access. An SMI can be generated on a memory access to one of 
three regions in the AO000h-to-BFFFFh range as specified in the BC_XMAP_1 register. 

1 SMI_IO SMI VGA Emulation I/O — This bit is set high if a SMI was generated for VGA emulation in 
response to an I/O access. An SMI can be generated on a I/O access to one of three regions in 
the 3B0h-to-3DFh range as specified in the BC_XMAP_1 register. 

0 SMI_PIN SMI Pin — When set high, this bit indicates that the SMI# input pin has been asserted to the 
MediaGX processor. 


Note: These bits are “sticky” bits and can only be cleared with a write of ‘1’ to the respective bit. 


GX_BASE+8504h-8507h 


PM_CNTRL_TEN Register (R/W) Default Value = xxxxxx00h 


31:8 


RSVD 


Reserved — These bits are not used. Do not write to these bits. 


7:6 


RSVD 


Reserved — Set to 0. 


5 


X_TEST (WO) 


Transmission Test (Write Only) — Setting this bit causes the MediaGX Processor to immedi- 
ately transmit the current contents of the serial packet. This bit is write only and is used primarily 
for test. This bit returns 0 on a read. 


4:3 


X_FREQ 


Transmission Frequency — This field indicates the time between serial packet transmissions. 
Serial packet transmissions occur at the selected interval only if at least one of the packet bits is 
set high: 00 = Disable transmitter; 01 = 1 ms; 10 =5 ms; 11 =10 ms. 


CPU_RD 


CPU Activity Read Enable — Setting this bit high enables reporting of CPU Level-1 cache read 
misses that are not a result of an instruction fetch. This bit is a don’t-care if the CMEN bit is not set 
high 


CPU_EN 


CPU Activity Master Enable — Setting this bit high enables reporting of CPU Level-1 cache 
misses in bit 6 of the serial transmission packet. When enabled, the CPU Level-1 cache miss 
activity is reported on any read (assuming the CREN is set high) or write access excluding misses 
that resulted from an instruction fetch. 


VID_EN 


Video Event Enable — Setting this bit high enables video decode events to be reported in bit 0 of 
the serial transmission packet. CPU or graphics-pipeline accesses to the graphics memory and 
display-controller-register accesses are also reported. 


GX_BASE+8508h-850Bh 


PM_CNTRL_CSTP Register (R/W) Default Value = xxxxxx00h 


31:8 RSVD Reserved — These bits are not used. Do not write to these bits. 
7 RSVD Reserved — Set to 0. 
0 CLK_STP Clock Stop — This bit configures the MediaGX processor for Suspend Refresh Mode or 3-Volt 


Suspend Mode: 


0 = Suspend Refresh Mode. The clocks to the memory and display controller are active. 
1 = 3-Volt Suspend Mode. All internal clocks are stopped. 


Note: When this register is set high and the Suspend input pin (SUSP#) is asserted, the MediaGX processor stops all it’s internal 
clocks, and asserts the Suspend Acknowledge output pin (GSUSPA#). Once SUSPA# is asserted the MediaGX processor’s 
SYSCLK input can be stopped. If this register is cleared, the internal memory-controller and display-controller clocks are 
not stopped on the SUSP#/SUSPA# sequence, and the SYSCLK input can not be stopped. 
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Table 6-2 Power Management Conirol and Status Registers (cont.) 


Bit Name Description 
GX_BASE+850Ch-850Fh PM_SER_PACK Register (R/W) Default Value = xxxxxx00h 
31:8 RSVD Reserved — These bits are not used. Do not write to these bits. 
7 VID_IRQ Video IRQ — This bit indicates the occurrence of a video vertical sync pulse. This bit is set at the 


same timer that the VINT (Vertical Interrupt) bit is set in the DC_TIMING_CFG register. The VINT 
bit has a corresponding enable bit (VIEN) in the DC_TIM_CFG register. 


6 CPU_ACT CPU Activity — This bit indicates the occurrence of a level 1 cache miss that was not a result of 
an instruction fetch. This bit has a corresponding enable bit in the PM_CNTL_TEN register. 
5:2 RSVD Reserved — Set to 0. 
1 USR_DEF Programmable Address Decode — This bit indicates the occurrence of a programmable mem- 


ory address decode. This bit is set based on the values of the PM_BASE register and the 
PM_MASK register. The PM_BASE register can be initialized to any address in the full 128MB 
address range. 


0 VID_DEC Video Decode — This bit indicates that the CPU has accessed either the Display Controller regis- 
ters or the graphics memory region. This bit has a corresponding enable bit in the 
PM_CNTRL_TEN. 


Note: The MediaGX processor transmits the contents of the serial packet only when a bit in the packet register is set and the 
interval counter has elapsed. The Cx5520/Cx5530 decodes the serial packet after each transmission. Once a bit in the 
packet is set, it will remain set until the completion of the next packet transmission. Successive events of the same type that 
occur between packet transmissions are ignored. Multiple unique events between packets transmissions will accumulate in 


this register. 
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Table 6-3. Power Management Programmable Address Region Registers 


Bit Name Description 
Index FFFF FF6Ch PM_BASE Register (R/W) Default Value = 0000000h 
31:28 RSVD Reserved — Set to 0. 
27:2 BASE_ADDR Base Address — This is the word-aligned base address for the programmable memory range 
compare. The actual address range is determined with this field and the PM_MASK register value. 
1:0 RSVD Reserved — Set to 0. 
Index FFFF FF7Ch PM_MASK Register (R/W) Default Value = 0000000h 
31:28 RSVD Reserved — Set to 0. 
27:2 ADR_MASK Address Mask — This field is the address mask for the BASE_ADDR field in the PM_BASE reg- 


ister. If a bit in the ADR_MASK field is cleared the corresponding bit in the BASE_ADDR field must 
match the processor address. If a bit in the mask field is set high, the corresponding bit in the 
BASE_ADDR field always compares. If the processor cycle type matches the values of the WE 
and RE bits, and all bits in the BADD field match the processor address based on the ADR_MASK 
field, bit 1 will be set high in the serial transmission packet. 


1 WE Write Enable — Compare memory write cycles with BASE_ADDR and ADR_MASK: 
0 = Disable; 1 = Enable. 
0 RE Read Enable — Compare memory read cycles with BASE_ADDR and ADR_MASK: 


0 = Disable; 1 = Enable 
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7 Electrical Specifications 


This section provides information on electrical 
connections, absolute maximum ratings, recom- 
mended operating conditions, DC characteristics, 
and AC characteristics. All voltage values in the 
Electrical Specifications are with respect to Vsg 
unless otherwise noted. For detailed information on 
the PCI bus electrical specification refer to Chapter 
4 of the PCI Bus Specification, Revision 2.1. 


7.1 Part Numbers 


The following part numbers designate the various 
speeds available. For all speeds, the Voce voltage 
is 2.9V nominal and the Vcc voltage is 3.3V 
nominal. 


Table 7-1 Part Numbers 
CPU Bus Speed (MHz) 

Speed & Multiplier Part Number 

200MHz 33.3MHz x 6 MediaGX-200BP 2.9V 
MediaGX-200GP 2.9V 

233MHz 33.3MHz x 7 MediaGX-233BP 2.9V 
MediaGX-233GP 2.9V 

266MHz 33.3MHz x 8 MediaGX-266BP 2.9V 
MediaGX-266GP 2.9V 

300MHz 33.3MHz x 9 MediaGX-300BP 2.9V 
MediaGX-300GP 2.9V 


Note: BP = BGA Package 
GP = SPGA Package 


7.2 Electrical Connections 


Power/Ground Connections 
and Decoupling 


Testing and operating the MediaGX processor 
requires the use of standard high frequency tech- 
niques to reduce parasitic effects. These effects 
can be minimized by filtering the DC power leads 
with low-inductance decoupling capacitors, using 
low-impedance wiring, and by utilizing all of the 
Veco: Voc: and VSS pins. 


7.2.1 


7.2.2 Power Sequencing 


the Core and I/O Voltages 


With two voltages connected to the MediaGX 
processor, it is important that the voltages come up 
in the correct order. Vcc2 should come up at or 
before Vcc3. There are no additional timing 
requirements related to this sequence. 


7.2.3 NC-Designated Pins 


Pins designated NC (No Connection) should be left 
disconnected. Connecting an NC pin to a pull-up/- 
down resistor, or an active signal could cause 
unexpected results and possible circuit malfunc- 
tions. 
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7.2.4 Pull-Up and Pull-Down 
Resistors 


Table 7-2 lists the input pins that are internally 
connected to a 20-kohm pull-up/-down resistor. 


When unused, these inputs do not require connec- 


tion to an external pull-up/-down resistor. 


Table 7-2 Pins with 20-kohm Internal Resistor 


Electrical Connections 


7.2.5 Unused Input Pins 


All inputs not used by the system designer and not 
listed in Table 7-2 should be kept at either ground 
or Vcc3- To prevent possible spurious operation, 
connect active-high inputs to ground through a 20- 
kohm (+10%) pull-down resistor and active-low 
inputs to Vec3 through a 20-kohm (410%) pull-up 
resistor. 


BGA SPGA 
Signal Name Ball No. | Pin No. PU/PD 
SUSP* H2 M4 Pull-up 
FRAME# A8 C13 Pull-up 
IRDY# cg D14 Pull-up 
TRDY# B9 B14 Pull-up 
STOP# C11 A15 Pull-up 
LOCK# B11 B16 Pull-up 
DEVSEL# AQ E15 Pull-up 
PERR# Al1 D16 Pull-up 
SERR# C12 A17 Pull-up 
REQ[2:0]# D3, E3, Pull-up 
H3, K2, 
E3 E1 
TCLK J2 P4 Pull-up 
TMS H1 N3 Pull-up 
TDI D2 F4 Pull-up 
TEST F3 J5 Pull-down 
Note: *SUSP# is pulled up when not active. 
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7.3 Absolute Maximum Ratings 


Table 7-3 lists absolute maximum ratings for the 
MediaGX processor. Stresses beyond the listed 
ratings may cause permanent damage to the device. 
Exposure to conditions beyond these limits may (1) 
reduce device reliability and (2) result in premature 
failure even when there is no immediately apparent 
sign of failure. Prolonged exposure to conditions at 


Table 7-3. Absolute Maximum Ratings 


or near the absolute maximum ratings may also 
result in reduced useful life and reliability. These 
are stress ratings only and do not imply that opera- 
tion under any conditions other than those listed 
under Table 7-4 is possible. 


Parameter Min Max Units Notes 
Operating Case Temperature -65 110 °C Power Applied 
Storage Temperature —65 150 °C No Bias 
Supply Voltage 3.6 Vv 

Voltage On Any Pin -0.5 6.0 V 

Input Clamp Current, Iix —0.5 10 mA Power Applied 
Output Clamp Current, lox 25 mA Power Applied 
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7.4 Recommended Operating Conditions 
Table 7-4 lists the recommended operating conditions for the MediaGX processor. 


Table 7-4 Recommended Operating Conditions 


Symbol | Parameter Min Max Units Notes 
To Operating Case Temperature 0 70 °C For Desktop Applications 
Tc Operating Case Temperature 0 85 °C For Notebook Applications 
Veco Supply Voltage (2.9V nominal) 2.75 3.05 Vv 
Voec3 Supply Voltage (3.3V nominal) 3.14 3.46 Vv 
Vin High-Level Input Voltage: 
All input and I/O pins except 2.0 5.5 V Note 1 
SDRAM Interface and SYSCLK 
SDRAM Interface 2.0 Vec3+0.5 Vv Note 2 
SYSCLK 2.7 5.5 Vv Note 1 
Vit Low-Level Input Voltage: 
All except PCI bus and SYSCLK -0.5 0.8 Vv 
PCl bus -0.5 0.3*Vocg3 Vv 
SYSCLK -0.5 0.4 Vv 
lou High-Level Output Current -2 mA Vo = Vou (Min) 
lot Low-Level Output Current 5 mA Vo = Vor (Max) 
Notes: 1) This parameter indicates that these pins are tolerant to the PCI 5 Volt Signaling Environment DC 
specification. 
2) SDRAM Interface Pins: BA[1:0], CAS[A:B]#, CKE[A:B], CS[3:0]#, DQM[7:0], MA[12:0], MD[63:0], 
RASA#, RASB#, SDCLK_IN, SDCLK_OUT, SDCLK[3:0], TEST[3:0], WE[A:B]# 


Page 216 Cyrix Corporation Confidential GXm_db_v2.0 


DC Characteristics 7 


7.5 DC Characteristics 


Table 7-5 DC Characteristics (at Recommended Operating Conditions) 
Symbol Parameter Min Typ Max Units Notes 
VoL Output Low Voltage 0.4 V lol =5mA 
Vou Output High Voltage 2.4 Vv lon =-2 mA 
I Input Leakage Current for all input pins except +10 HA 0< Vin < Voc3: 
those with internal PU/PDs See Table 7-2 
lin Input Leakage Current for all pins with 200 LA Vin = 2.4 V, 
internal PDs. See Table 7-2 
lit Input Leakage Current for all pins with —400 LA Vi_ = 0.35 V, 
internal PUs. See Table 7-2 
loc Active loc: 
Core Ioc¢2 at fork = 200MHz 1.45 2.55 A Note 1 
V/O Io¢c¢3 at fork = 200MHz 0.30 0.34 
Core Ioc¢2 at fo_k = 233MHz 1.55 2.85 
V/O Io¢c3 at fork = 233MHz 0.32 0.35 
Core Ioc¢2 at fo__ = 266MHz 1.65 3.10 
V/O Ioc¢3 at fork = 266MHz 0.33 0.36 
Core Icc2 at folk = 300MHz 1.75 3.35 
V/O Ic at fork = 300MHz 0.34 0.37 
locsm Suspend Mode Icc: 
Core Ioc¢2 at fork = 200MHz 285 360 mA Notes 1 and 4 
V/O Io¢c¢3 at fork = 200MHz 240 300 
Core Ioc2 at fo_q = 233MHz 530 600 
V/O Ioc¢3 at fork = 233MHz 250 310 
Core Ioc2 at fo__ = 266MHz 650 750 
V/O Io¢c3 at fork = 266MHz 260 330 
Core Icc¢2 at fork = 300MHz 770 900 
V/O I¢c¢3 at fork = 300MHz 270 350 
locss Standby I¢c¢ (Suspend and CLK Stopped): 
Core Ioc¢2 at fo_q = OMHz 10 60 mA Notes 1 and 3 
fe) loc3 at fotk = OMHz 7 10 
Cin Input Capacitance 16 pF f = 1MHz, Note 2 
Cout Output or I/O Capacitance 16 pF f = 1MHz, Note 2 
Cok CLK Capacitance 12 pF f = 1MHz, Note 2 
Notes: 1. fo,, ratings refer to internal clock frequency. 
2. Not 100% tested. 
3. All inputs are at 0.2 V or Voc3 — 0.2 (CMOS levels). All inputs are held static and all outputs are unloaded 
(static loyt = 0 mA). 
4. All inputs are at 0.2 V or Vocg — 0.2 (CMOS levels). All inputs except clock are held static and all outputs are unloaded 
(static Ioyt = 0 mA). 
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7.6 AC Characteristics All AC tests are at Veco = 2.75V to 3.05V (2.9V 
The following tables list the AC characteristics nominal), Tc = 0°C to 70°C or 85°, C, = 50 pF 
including output delays, input setup requirements, unless otherwise specified. 

input hold requirements and output float delays. 

The rising-clock-edge reference level Ves and Table 7-6 Drive Level and Measurement 
other reference levels are shown in Table 7-6. Points for Switching Characteristics 
Input or output signals must cross these levels during 

testing. Symbol Voltage (V) 
Input setup and hold times are specified minimums VREF i 

that define the smallest acceptable sampling VIHD 2.4 

window for which a synchronous input signal must Vis 0.4 


be stable for correct operation. 


Tx 


V 
CLK as f \ { \ VREF 


OUTPUTS Valid Output , — — Valid Output ,,4 — — ~ Veer 


V w 
INPUTS Bae ie — = Valid Input - 


ViLD 


Legend: A = Maximum Output Delay Specification 
B = Minimum Output Delay Specification 
C = Minimum Input Setup Specification 
D = Minimum Input Hold Specification 


Figure 7-1 Drive Level and Measurement Points for Switching Characteristics 
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Table 7-7 Clock Signals 


200MHz (6x) 233MHz (7x) 266MHz (8x) 300MHz (9x) 
(Note) (Note) (Note) (Note) 
Symbol | Parameter Min Max Min Max Min Max Min Max Units 
1 SYSCLK Period 30.0 30.0 30.0 30.0 ns 
t2 SYSCLK Period Stability +250 +250 +250 +250 ps 
3 SYSCLK High Time 10 10 10 10 ns 
4 SYSCLK Low Time 10 10 10 10 ns 
tS SYSCLK Fall Time 0.15 2.0 0.15 2.0 0.15 2.0 0.15 2.0 ns 
t6 SYSCLK Rise Time 0.15 2.0 0.15 2.0 0.15 2.0 0.15 2.0 ns 
t7 DCLK Period 9.3 9.3 9.3 9.3 ns 
8 DCLK Rise/Fall Time 3.0 3.0 3.0 3.0 ns 
t9 SDCLK_OUT, 13 17 11 16 10 13 9 11 ns 
SDCLK[3:0] Period 
t10 SDCLK_OUT, 6.5 5.5 5 45 ns 
SDCLK[3:0] High Time 
11 SDCLK_OUT, 6.5 5.5 5 4.5 ns 
SDCLK[3:0] Low Time 
t12 SDCLK_OUT, 0.15 2.0 0.15 2.0 0.15 2.0 0.15 2.0 ns 
SDCLK[3:0] Fall Time 
13 SDCLK_OUT, 0.15 2.0 0.15 2.0 0.15 2.0 0.15 2.0 ns 
SDCLK[3:0] Rise Time 
Note: SDCLK timings (t9-t13) assume an SDCLK that is a "divide by 3" from the internal core clock. Hence: 
200MHz (6x) = 66.7MHz SDCLK 
233MHz (7x) = 77.7MHz SDCLK 
266MHz (8x) = 88.7MHz SDCLK 
300MHz (9x) = 100MHz SDCLK 
t1 | 
at a 
I 13 | 
—+__——_ 
I | 
VIH (Min) 
1.5V 
VIL (Max) 


SYSCLK 


Figure 7-2 SYSCLK Timing and Measurement Points 
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3 aad 
| | 
| 
| 
DCLK 
Figure 7-3 DCLK Timing and Measurement Points 
i} t9 | 
~#t > 
F t10 | | 
—<——__—_— 
I | 
VIH (Min) 
1.5V 
VIL (Max) 
SDCLK, | | 
SDCLK[3:0] me, <a — —> <= 


Figure 7-4 SDCLK, SDCLK[3:0] Timing and Measurement Points 


Table 7-8 System Signals 


Parameter Min Max Unit Notes 


Setup Time for RESET, INTR ns Note 
Hold Time for RESET, INTR 


5 
2 
Setup Time for SMI#, SUSP#, FLT# 5 ns 
2 
2 


ns Note 


Hold Time for SMl#, SUSP#, FLT# ns 

Valid Delay for IRQ13, SUSPA# 15 ns 

Valid Delay for SERIALP 2 15 ns 

Note: The system signals may be asynchronous. The setup/hold times are required for determining static 
behavior. 
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Table 7-9 PCI Interface Signals 


Symbol Parameter Min Max Unit Notes 
tvaLt Delay Time, SYSCLK to Signal Valid for Bused 2 11 ns 
Signals 

tyaLe2 Delay Time, SYSCLK to Signal Valid for GNT# 2 12 ns Note 
ton Delay Time, Float to Active 2 ns 
torr Delay Time, Active to Float 28 ns 
tsut Input Setup Time for Bused Signals 7 ns 
tsue Input Setup Time for REQ# 12 ns Note 
ty Input Hold Time to SYSCLK 0 ns 
Note: GNT# and REQ# are point-to-point signals. All other PCI interface signals are bused. 

Refer to Chapter 4 of PCI Local Bus Specification, Revision 2.1, for more detailed information. 


SYSCLK ee ne ee 


| 
WVALI.2 [gee 


OUTPUT 
OUTPUT 


| 
torr | 


Figure 7-5 Output Timing 


SYSCLK A ee 


Figure 7-6 Input Timing 
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Table 7-10 SDRAM Interface Signals 


Symbol | Parameter Min Max Unit 
t1 CNTRL* Output Valid from Equation Number = Equation Number = ns 
SDCLK[3:0] —1.5 (see below) —1.0 (see below) 
t2 MA[12:0], BA[1:0] Output Valid from Equation Number = Equation Number = ns 
SDCLK[3:0] —1.7 (see below) —1.2 (see below) 

3 MD[63:0] Output Valid from Equation Number = Equation Number = ns 
SDCLK[3:0] —1.6 (see below) —0.3 (see below) 
t4 MD[63:0] Read Data in Setup to 0 ns 

SDCLKIN 
ite) MD[63:0] Read Data Hold to SDCLKIN 2.0 ns 
*CNTRL = RASA#, RASB# CASA#, CASB#, WEA#, WEB#, CKEA, CKEB, DQM[7:0], CS[3:0]#. 
Load = 50pF, Core Vcc = 2.9, I/O Vcc = 3.3V, 25°C. 
Output Valid Equation: Use Min or Max number in equation: Min# or Max# + (x * y) 
Where: x = shift value applied to SHFTSDCLK field and y = (core clock period) + 2 
Note that SHFTSDCLK field = GX_BASE+8404h[5:3], see page 123. 
Equation Example: 
A 200MHz MediaGX processor running a 66MHz SDRAM bus, with a shift value of 2: 
t1 Min =-1.5 + (2* (6+2))=3.5ns 
t1 Max = -1.0 + (2* (6 +2)) = 4.0 ns 
~—_ >, [1, 12, 13 
—— a ; ———_ 4 
SDCLK[3:0] | 
CNTRL, MA[12:0], 
Balt), MOIe3 0 | 
Figure 7-7 Output Valid Timing 
SDCLKIN 
MDJ[63:0] 
Figure 7-8 Setup and Hold Timings - Read Data In 
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Table 7-11 Video Interface Signals 


Symbol Parameter Min Max Unit Notes 
t1 PCLK Period 7.4 40 ns 
t2 PCLK High Time 3 ns 
t3 PCLK Low Time 3 ns 
t4 PIXEL[17:0], CRT_HSYNC, CRT_VSYNC, 2 5 ns 


FP_HSYNC, FP_VSYNC, ENA_DISP Valid Delay 
from PCLK Rising Edge 


t5 VID_CLK Period 8.5 ns 
t6 VID_RDY Setup to VID_CLK Rising Edge 5 ns 
t7 VID_RDY Hold to VID_CLK Rising Edge 2 ns 
t8 VID_VAL, VID_DATA|7:0] Valid Delay from 2 5 ns 
VID_CLK Rising Edge 
t9 DCLK Period 7.4 ns 
t10 DCLK Rise/Fall Time 3 ns 
tcyc DCLK Duty Cycle 40 60 % 
ia t1 i t4 
| 2 13 | | 
i >\< >| 
PCLK | | | | | | 
PIXEL[17:0], 
CRT_HSYNC, CRT_VSYNC, 
FP_HSYNC, FP_VSYNC, 
ENA_DISP 


Figure 7-9 Graphics Port Timing 
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| 5 | | | 16 _ | 
——_—$_ 


oH | ] 
VID_CLK | | | | | | | | | | | 


VID_VAL | | 


VID_RDY | | 


VID_DATAI7:0] 


Figure 7-10 Video Port Timing 


Figure 7-11 DCLK Timing 
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Table 7-12 JTAG AC Specification 
Symbol Parameter Min Max Unit Notes 
TCK Frequency (MHz) 25 MHz 
t1 TCK Period 40 ns 
t2 TCK High Time 10 ns 
t3 TCK Low Time 10 ns 
t4 TCK Rise Time 4 ns 
t5 TCK Fall Time 4 ns 
t6 TDO Valid Delay 25 ns 
t7 Non-test Outputs Valid Delay 25 ns 
(8 TDO Float Delay 30 ns 
t9 Non-test Outputs Float Delay 36 ns 
t10 TDI, TMS Setup Time 8 ns 
11 Non-test Inputs Setup Time 8 ns 
t12 TDI, TMS Hold Time 7 ns 
t13 Non-test Inputs Hold Time 7 ns 
t1 
VIH(Min) 
1.5V 
ViL(Max) 
TCK 


Figure 7-12 TCK Timing and Measurement Points 
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TCK 


TDI, 
TMS 


TDO 


Output 
Signals 


Input 
Signals 


1.5V 


t10 t12 


Figure 7-13 JTAG Test Timings 
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8 Package Specifications 


The thermal characteristics and mechanical dimen- 
sions are provided on the following pages. 


8.1 Thermal Characteristics 

The MediaGX processor is designed to operate 
when the case temperature at the top center of the 
package is between 0°C and 70 or 85°C. The 
maximum die (junction) temperature and the 
maximum ambient temperature can be calculated 
by substituting thermal resistance and maximum 
values for case or junction temperature and power 
dissipation in the following equations: 


Ty = Tc + (P* 8Jc) 
Ty= Ta + (P* 8a) 
where: 
Ty, = Ambient temperature (°C) 
Ty = Average junction temperature (°C) 


= Case temperature at top center of package 
(°C) 


Integrated x86 Solution with MMX™ Support 


s 


P = Power dissipation (W) 
@jc = Junction-to-case thermal resistance 
(°C/W) 
8ja = Junction-to-ambient thermal resistance 
(°C/W). 
Oca = Case-to-ambient thermal resistance 
(°C/W). 
Therefore, this equation can be used to calculate 
the maximum 8ca value for the different ambient 
temperatures shown in Table 8-1 below: 


lot 
Rope 


The calculated 8c, value (examples shown in the 
Tables 8-1 and 8-2) represents the maximum spec- 
ification for the cooling solution chosen which is 
required to maintain the 70 or 85°C case tempera- 
ture for the application in which the device is used. 
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Table 8-1 Case to Ambient Thermal Resistance Examples for 70°C Product 


Oca for Different Ambient Temperatures (°C/W) 


Frequency Maximum 
Part Number (MHz) Power (W) 20°C 25°C 30°C 35°C 40°C 


Case Temperature = 70°C 


GM200 200 8.95 5.59 5.03 4.47 3.91 3.35 
GM233 233 9.87 5.07 4.56 4.05 3.55 3.04 
GM266 266 10.70 4.67 4.21 3.74 3.27 2.80 
GM300 300 11.27 4.44 3.99 3.55 3.11 2.66 


Table 8-2. Case to Ambient Thermal Resistance Examples for 85°C Product 


Oca for Different Ambient Temperatures (°C/W) 


Frequency Maximum 
Part Number (MHz) Power (W) 20°C 25°C 30°C 35°C 40°C 


Case Temperature = 85°C 


GM200 200 8.95 7.26 6.70 6.15 5.59 5.03 
GM233 233 9.87 6.59 6.08 5.57 5.07 4.56 
GM266 266 10.70 6.08 5.61 5.14 4.67 4.21 
GM300 300 11.27 5.77 5.32 4.88 4.44 3.99 
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8.2 Mechanical Package Outlines 


Dimensions for the BGA package are shown in sions. Table 8-3 gives the legend for the symbols 
Figure 8-1. Figure 8-2 shows the SPGA dimen- used in both package outlines. 
I 
< D > Seating |<—{qjaaa[Z 
c—— ———= — > Plane | 
Di sim |< Z f 
A 
ae Eee eee eee meer 
REF. goooeocooecgagegecaoecegeaoeegeeoenaeaeaeeag & 
y oooooogoeococococcae eg cao e Koo oe oO 8 o 
—— eood eooo 
A oooed ooosd o 
sae Sia 3 
eooodo eoooo —oa 
eoood oaoosd Ag 
eee oo eM 
eocoo eooco D a 
ee eee pwd 
oooo cooo°o Ao 
oooed oooso a 
ee eae J 
ooooocoegocgccocoogocoe cao aoe oO 8 al 
Me es eee b raceee es ee ee ad meee J 
15 oooooococeacgeoecegcgc ce eoeeaeeae se Vv 
Al < | 
AS 15 |< | <— 
<x> 
< > 
D2 
Millimeters Inches 


sym Min Max Min Max 


A | 145 | 2.23 | 0.057 | 0.088 | FOJ_ 
Ai | 050 | 0.70 | 0.020 | 0.028 
A2 | 0.43 | 0.83 | 0.017 | 0.033 7 
aaa 0.20 0.008 a — | _ CU Heat 


| Spreader 
B 0.60 | 0.90 | 0.024 | 0.035 
D | 34.80 | 35.20 | 1.370 | 1.386 
D1 | 31.55 | 31.95 | 1.242 | 1.258 
p2 | 32.80 | 35.20 | 1.291 | 1.386 
E1 1.12 | 1.42 | 0.044 | 0.056 
F 0.35 0.014 
St 1.42 | 1.82 | 0.056 | 0.072 y= fo Ines Ghamiet 


45 Degree Angle 


Figure 8-1 352-Terminal BGA Mechanical Package Outline 
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AINE > 
> Li< 
vv 
K 
A 
Fe ey 
a i 
BA 
Pe me 5 Be ko ae Oa Bo HOLS ens 
a 45 ‘CHAMFER 
2.29 REF, (INDEX CORNER) n ES ae 
< D > 
Millimeters Inches 
sym Min Max Min Max 
FL]. 
A 2.51 3.07 0.099 | 0.121 : 
B 0.43 0.51 0.017 | 0.020 
D 49.28 | 49.91 1.940 | 1.965 ee Abt indox riaile 
.030" blank circ 
D1 | 45.47 | 45.97 | 1.790 | 1.810 Wage bey el 
E1 2.41 2.67 0.095 | 0.105 / Sitcle to form donut 
E2 1.14 1.40 0.045 | 0.055 rs 
F -- 0.127 -- 0.005 / 
Diag Diag / 
L | 297 | 3.38 | 0.117 | 0.133 Pi 
S1 1.65 2.16 0.065 | 0.085 


Figure 8-2 320-Pin SPGA Mechanical Package Outline 
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Table 8-3 Mechanical Package Outline Legend 


Symbol Meaning 


A Distance from seating plane datum to highest point of body 
Al Solder ball height 

A2 Laminate thickness (excluding heat spreader) 
aaa Coplanarity 

B Pin or solder ball diameter 

D Largest overall package outline dimension 

D1 Length from outer pin center to outer pin center 

D2 Heat spreader outline dimension 

E1 BGA: Solder ball pitch 

SPGA: Linear spacing between true pin position centerlines 

E2 Diagonal spacing between true pin position centerlines 

F Flatness 


Distance from seating plane to tip of pin 


S1 Length from outer pin/ball center to edge of laminate 
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9 Instruction Set 
This section summarizes the MediaGX processor 4. There are no local bus HOLD requests 


instruction set and provides detailed information on 
the instruction encodings. The instruction set is 


broken into four categories: .. 
¢ Processor Core Instruction Set - listed in Table 
9-27 on page 246 
¢ FPU Instruction Set - listed in Table 9-29 on 
page 261 
* MMX™ Instruction Set - listed in Table 9-31 on 
page 267 
* Cyrix Extended MMxX Instruction Set - listed in % 


Table 9-33 on page 273 


These tables provide information on the instruction 8. 
encoding, and the instruction clock counts for each 
instruction. The clock count values for these tables 
are based on the assumptions following assumptions 


1. All clock counts refer to the internal micropro- _ 
cessor core internal clock frequency. For 
example, clock doubled MediaGX processor 
cores will reference a clock frequency that is 
twice the bus frequency. 
10 


2. The instruction has been prefetched, decoded 
and is ready for execution. 


3. Bus cycles do not require wait states. 


delaying processor access to the bus. 


No exceptions are detected during instruction 
execution. 


If an effective address is calculated, it does not 
use two general register components. One 
register, scaling and displacement can be used 
within the clock count shown. However, if the 
effective address calculation uses two general 
register components, add one clock to the 
clock count shown. 


All clock counts assume aligned 32-bit 
memory/IO operands. 


If instructions access a 32-bit operand on odd 
addresses, add one clock for read or write and 
add two clocks for read and write. 


For non-cached memory accesses, add two 
clocks (clock doubled MediaGX processor 
cores) or four clocks (clock tripled MediaGX 
processor cores), assuming zero wait state 
memory accesses. 


. Locked cycles are not cacheable. Therefore, 


using the LOCK prefix with an instruction adds 
additional clocks as specified in item 9 above. 
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9.1 General Instruction Set Format 


Depending on the instruction, the MediaGX 
processor core instructions follow the general 
instruction format shown in Table 9-1. 


address displacement, and immediate data. An 
instruction can be as short as one byte and as long 
as 15 bytes. If there are more than 15 bytes in the 
instruction, a general protection fault (error code 0) 
is generated. 


These instructions vary in length and can start at 


any byte address. An instruction consists of one or 
more bytes that can include prefix bytes, at least 
one opcode byte, a mod r/m byte, an s-i-b byte, 


The fields in the general instruction format at the 
byte level are summarized in Table 9-2 and 
detailed in the following subsections. 


Table 9-1 General Instruction Set Format 


Prefix (optional) 


Opcode mod reg r/m ss index | base 


Register and Address Mode Specifier 


mod r/m Byte s-i-b Byte 


Address Immediate 
Displacement Data 


0 or More Bytes 


1 or 2 Bytes 7:6 5:3 2:0 7:6 5:3 2:0 


0, 8, 16, or 32 Bits | 0, 8, 16, or 32 Bits 


Table 9-2 Instruction Fields 


Field Name 


Description 


Prefix (optional) 


Prefix Field(s): One or more optional fields that are used to specify segment register override, 
address and operand size, repeat elements in string instruction, LOCK# assertion. 


Opcode Opcode Field: Identifies instruction operation. 

mod Address Mode Specifier: Used with r/m field to select addressing mode. 

reg General Register Specifier: Uses reg, sreg3 or sreg2 encoding depending on opcode field. 
r/m Address Mode Specifier: Used with mod field to select addressing mode. 

ss Scale factor: Determines scaled-index address mode. 

index Index: Determines general register to be used as index register. 

base Base: Determines general register to be used as base register. 


Address Displacement 


Displacement: Determines address displacement. 


Immediate Data 


Immediate Data: Immediate-data operand used by instruction. 
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9.1.1 Prefix (Optional) 


Prefix bytes can be placed in front of any instruc- 
tion to modify the operation of that instruction. 
When more than one prefix is used, the order is not 
important. There are five types of prefixes that can 
be used: 


1. Segment Override explicitly specifies which 
segment register the instruction will use for 
effective address calculation. 


2. Address Size switches between 16-bit and 32- 
bit addressing by selecting the non-default 
address size. 


3. Operand Size switches between 16-bit and 32- 
bit operand size by selecting the non-default 
operand size. 


4. Repeat is used with a string instruction to 
cause the instruction to be repeated for each 
element of the string. 


5. Lock is used to assert the hardware LOCK# 
signal during execution of the instruction. 


Table 9-3 lists the encoding for different types of 
prefix bytes. 


9.1.2 Opcode 


The opcode field specifies the operation to be 
performed by the instruction. The opcode field is 
either one or two bytes in length and may be 
further defined by additional bits in the mod r/m 
byte. Some operations have more than one 
opcode, each specifying a different form of the 
operation. Certain opcodes name instruction 
groups. For example, opcode 80h names a group 
of operations that have an immediate operand and 
a register or memory operand. The reg field may 
appear in the second opcode byte or in the mod 
r/m byte. 


The opcode may contain w, d, s and eee opcode 
fields as shown in the Processor Core Instruction 
Set Summary (Table 9-27). 


Table 9-3 Instruction Prefix Summary 

Prefix Encoding | Description 

ES: 26h Override segment default, use 
ES for memory operand. 

CS: 2Eh Override segment default, use 
CS for memory operand. 

Ss: 36h Override segment default, use 
SS for memory operand. 

DS: 3Eh Override segment default, use 
DS for memory operand. 

FS: 64h Override segment default, use 
FS for memory operand. 

GS: 65h Override segment default, use 
GS for memory operand. 

Operand 66h Make operand size attribute the 

Size inverse of the default. 

Address 67h Make address size attribute the 

Size inverse of the default. 

LOCK FOh Assert LOCK# hardware signal. 

REPNE F2h Repeat the following string 
instruction. 

REP/REPE F3h Repeat the following string 
instruction. 


9.1.2.1 w Field (Operand Size) 


When used, the 1-bit w field selects the operand 
size during 16-bit and 32-bit data operations. See 
Table 9-4. 


Table 9-4 w Field Encoding 


Operand Size 
w 16-Bit Data 32-Bit Data 
Field Operations Operations 
0 8 bits 8 bits 
1 16 bits 32 bits 
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9.1.2.2 d Field (Operand Direction) 


When used, the d field (bit 1) determines which 
operand is taken as the source operand and which 


operand is taken as the destination. See Table 9-5. 


9.1.2.3 s Field (Immediate Data Field 
Size) 

When used, the s field (bit 1) determines the size of 

the immediate data field. If the s bit is set, the 


Table 9-5 d Field Encoding 


d Direction of Source Destination 
Field Operation Operand Operand 
0 Register-to-Register | reg mod r/m 
or or 
Register-to-Memory mod ss-index- 
base 
1 Register-to-Register | mod r/m reg 
or or 
Memory-to-Register | mod ss-index- 
base 


immediate field of the opcode is 8 bits wide and is 
sign-extended to match the operand size of the 


apodeme tate 20: Table 9-6 _s Field Encoding 


Immediate Field Size 


9.1.2.4 eee Field (MOV-Instruction : ; 
: : s 8-Bit 16-Bit 32-Bit 
Register Selection) Field Operand Size | Operand Size | Operand Size 
The eee field (bits [5:3]) is used to select the 0 (or not 8 bits 16 bits 32 bits 
control, debug and test registers in the MOV present) 
instructions. The type of register and base regis- { 8 bits 8 bits 8 bits 
ters selected by the eee field are listed in Table 9-7. (sign-extended) | (sign-extended) 


The values shown in Table 9-7 are the only valid 
encodings for the eee bits. 


Table 9-7 eee Field Encoding 


eee Field Register Type Base Register 
000 Control Register CRO 
010 Control Register CR2 
011 Control Register CR3 
100 Control Register CR4 
000 Debug Register DRO 
001 Debug Register DRI 
010 Debug Register DR2 
011 Debug Register DR3 
110 Debug Register DR6 
111 Debug Register DR7 
011 Test Register TR3 
100 Test Register TR4 
101 Test Register TR5 
110 Test Register TR6 
111 Test Register TR7 
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9.1.3 mod and r/m Byte Table 9-9 mod r/m Field Encoding 
(Memory Addressing) 32-Bit Address 
The mod and r/m fields within the mod r/m byte, Pera teers aise le smrieileny see 
select the type of memory addressing to be used. Field | Field mod rim Byte Bae Biseant 
Some instructions use a fixed addressing mode 
(e.g., PUSH or POP) and therefore, these fields are pe O00: Ps IB rel PSE 
not present. Table 9-9 lists the addressing method 00 | 001 | DS{BX+Dl] DS:[ECX] 
when 16-bit addressing is used and a mod r/m byte 00 010 | SS:[BP+SI] DS:[EDX] 
is present. Some mod r/m field encodings are 00 011 | SS:[BP+DI] DS:[EBX] 
dependent on the w field and are shown in Table 9- 00 100 | DS{SI] s-i-b is present 
8. (See Table 9-15) 
00 101 | DSDI] DS:[d32] 
Table 9-8 General Registers Selected by mod = me polar pial 
r/m Fields and w Field pee Bele pode 
16-bit 32-Bit 01 000 | DS[BX+Sl+d8] DS[EAX+d8] 
Operation Operation 
01 001 | DS:[BX+DI+d8] DS:[ECX+d8] 
mod || Oe as |e | 01 | 010 | SS{BP+SI+d8] DS:[EDX+d8] 
11 000 AL AX AL EAX 01 011 | SS:[BP+DI+d8] DS:[EBX+d8] 
11 001 CL CX CL ECX 01 100 | DS{[SI+d8] s-i-b is present 
1 010 DL DX DL EDX (See Table 9-15) 
11 011 BL BX BL EBX 01 101 DS:[Dl+d8] SS:[EBP+d8] 
11 100 AH sP AH ESP 01 110 | SS:[BP+d8] DS:[ESI+d8] 
11 101 CH BP CH EBP 01 111 DS:[BX+d8] DS:[EDI+d8] 
11 110 DH S| DH ESI 
11 111 BH DI BH EDI 10 000 | DS:[BX+SI+d16] DS:[EAX+d32] 
10 001 | DS:[BX+Dl+d16] DS:[ECX+d32] 
10 010 | SS[BP+Sl+d16] DS:[EDX+d32] 
10 011 | SS:[BP+Dl+d16] DS:[EBX+d32] 
10 100 | DS:[Sl+d16] s-i-b is present 
(See Table 9-15) 
10 101 | DS[Dl+d16] SS[EBP+d32] 
10 110 | SS:[BP+d16] DS[ESl+d32] 
10 111 | DS:[BX+d16] DS:[EDI+d32] 
11 xxx | See Table 9-8. See Table 9-8 


Note: Note: d8 refers to 8-bit displacement, and d16 refers to 
16-bit displacement. 
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9.1.4 reg Field Table 9-10 General Registers Selected by reg 
The reg field (Table 9-10) determines which Field 
general registers are to be used. The selected 16-Bit Operation 32-Bit Operation 
register is dependent on whether a 16- or 32-bit ii esti a eon aa 
operation is current and on the status of the w bit. 
000 AL AX AL EAX 
001 CL CX CL ECX 
9.1.4.1 sreg2 Field (ES, CS, SS, DS 010 DL DX DL EDX 
Register Selection) 011 BL BX BL EBX 
The sreg2 field (Table 9-11) is a 2-bit field that 100 AH SP AH ESP 
allows one of the four 286-type segment registers 101 CH BP CH EBP 
to be specified. 110 DH Sl DH ESI 
111 BH DI BH EDI 


9.1.4.2 sreg3 Field (FS and GS 
Segment Register Selection) 


Table 9-11. sreg2 Field Encodi 
The sreg3 field (Table 9-12) is 3-bit field that is able ft plede He cencedig 


similar to the sreg2 field, but allows use of the FS sreg2 Field Segment Register Selected 
and GS segment registers. 00 ES 

01 cs 

10 ss 

11 Ds 


Table 9-12 sreg3 Field Encoding 


sreg3 Field Segment Register Selected 
000 ES 
001 cs 
010 Ss 
011 DS 
100 FS 
101 GS 
110 Undefined 
111 Undefined 
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9.1.5 s-i-b Byte (Scale, Indexing, 


Base) 
The s-i-b fields provide scale factor, indexing and a 
base field for address selection. The ss, index and 
base fields described next. 


9.1.5.1 ss Field (Scale Selection) 


The ss field (Table 9-13) specifies the scale factor 
used in the offset mechanism for address calcula- 
tion. The scale factor multiplies the index value to 
provide one of the components used to calculate 
the offset address. 


Table 9-13 ss Field Encoding 


ss Field Scale Factor 
00 x1 
01 x2 
01 x4 
11 x8 


9.1.5.2 index Field (Index Selection) 


The index field (Table 9-14) specifies the index 
register used by the offset mechanism for offset 
address calculation. When no index register is 
used (index field = 100), the ss value must be 00 or 
the effective address is undefined. 


Table 9-14 index Field Encoding 


Index Field Index Register 
000 EAX 
001 ECX 
010 EDX 
011 EBX 
100 none 
101 EBP 
110 ESI 
111 EDI 


9.1.5.3 Base Field (s-i-b Present) 


In Table 9-9, the note “s-i-b present” for certain 
entries forces the use of the mod and base field as 
listed in Table 9-15. The first two digits in the first 
column of Table 9-15 identifies the mod bits in the 
mod r/m byte. The last three digits in the first 
column of this table identifies the base fields in the 
s-i-b byte. 


Table 9-15 mod base Field Encoding 


mod Field base Field 
within within 
mode/rm s-i-b 32-Bit Address Mode 
Byte Byte with mod r/m and s-i-b 

(bits 7:6) (bits 2:0) Bytes Present 
00 000 DS:[EAX+(scaled index)] 
00 001 DS:[ECX+(scaled index)] 
00 010 DS:[EDX+(scaled index)] 
00 011 DS:[EBX+(scaled index)] 
00 100 SS:[ESP+(scaled index)] 
00 101 DS:[d32+(scaled index)] 
00 110 DS:[ESI+(scaled index)] 
00 111 DS:[EDI+(scaled index)] 
01 000 DS:[EAX+(scaled index)+d8] 
01 001 DS:[ECX+(scaled index)+d8] 
01 010 DS:[EDX+(scaled index)+d8] 
01 011 DS:[EBX+(scaled index)+d8] 
01 100 SS:[ESP+(scaled index)+d8] 
01 101 SS:[EBP+(scaled index)+d8] 
01 110 DS:[ESI+(scaled index)+d8] 
01 111 DS:[EDI+(scaled index)+d8] 
10 000 DS:[EAX+(scaled index)+d32] 
10 001 DS:[ECX+(scaled index)+d32] 
10 010 DS:[EDX+(scaled index)+d32] 
10 011 DS:[EBX+(scaled index)+d32] 
10 100 SS:[ESP+(scaled index)+d32] 
10 101 SS:[EBP+(scaled index)+d32] 
10 110 DS:[ESI+(scaled index)+d32] 
10 111 DS:[EDI+(scaled index)+d32] 
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9.2 CPUID Instruction Table 9-16 CPUID Levels Summary 

The CPUID instruction (opcode OFA2) allows the Initialized 

software to make processor inquiries as to the CPUID EAX Returned Data in EAX, EBX, 
vendor, family, model, stepping, features and also Type Register ECX, EDX Registers 
provides cache information. The MediaGX with Standard | 0000 0000h | Maximum standard levels, CPU 
MMxX supports both the standard and Cyrix vendor string 

extended CPUID levels. Standard | 0000 0001h | Model, family, type and features 


Standard | 0000 0002h | TLB and cache information 
Extended | 8000 0000h | Maximum extended levels 
Extended | 8000 0001h | Extended model, family, type and 


The presence of the CPUID instruction is indicated 
by ability to change the value of the ID Flag, bit 21 
in the EFLAGS register. 


features 
The CPUID level allows the CPUID instruction to Bxtsnde da) BOVE ON02I aoe Wunlarerng arene sting 
return different information in EAX, EBX, ECX, Extended | 8000 0003h 
EDX, registers. The level is determined by the Extended | 8000 0004h 
initialized value of the EAX register before the Extended | 8000 0005h | TLB and L1 cache description 


instruction is executed. A summary of the CPUID 
levels is shown in Table 9-16. 
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9.2.1 Standard CPUID Levels 


The standard CPUID levels are part of the stan- 
dard x86 instruction set. 


9.2.1.1 CPUID Instruction with 
EAX = 0000 0000h 


Standard function Oh (EAX = 0) of the CPUID 
instruction returns the maximum standard CPUID 
levels as well as the processor vendor string. 


After the instruction is executed, the EAX register 
contains the maximum standard CPUID levels 
supported. The maximum standard CPUID level is 
the highest acceptable value for the EAX register 
input. This does not include the extended CPUID 
levels. 


The EBX through EDX registers contain the vendor 
string of the processor as shown in Table 9-17. 


Table 9-17 CPUID Data Returned when EAX = 0 


Register 
(Note) Returned Contents Description 
EAX 2 Maximum Standard 


Level 


EBX 69 72 79 43. (iryC) | Vendor ID String 1 


EDX 73 6E 49 78 (snix) | Vendor ID String 2 


ECX 64 61 65 74 (daet) | Vendor ID String 3 


Note: The register column is intentionally out of order. 


9.2.1.2 CPUID Instruction with 
EAX = 0000 0001h 


Standard function 01h (EAX = 1) of the CPUID 
instruction returns the processor type, family, 
model, and stepping information of the current 
processor in the EAX register (see Table 9-18). 
The EBX and ECX registers are reserved. 


Table 9-18 EAX, EBX, ECX CPUID Data 
Returned when EAX = 1 


Returned 

Register Contents Description 
EAX[3:0] XX Stepping ID 
EAX[7:4] 4 Model 
EAX[11:8] 5 Family 
EAX[15:12] 0 Type 
EAX[31:16] Reserved 
EBX - Reserved 
ECX > Reserved 


The standard feature flags supported are returned 
in the EDX register as shown in Table 9-19. Each 
flag refers to a specific feature and indicates if that 
feature is present on the processor. Some of these 
features have protection control in CR4. Before 
using any of these features on the processor, the 
software should check the corresponding feature 
flag. Attempting to execute an unavailable feature 
can cause exceptions and unexpected behavior. 
For example, software must check bit 4 before 
attempting to use the Time Stamp Counter instruc- 
tion. 


GXm_db_v2.0 


Cyrix Corporation Confidential 


Page 241 


‘MediafOx 


MM X" - 


PROCESSOR 


EN H AN ¢€ 


Table 9-19 EDX CPUID Data 
Returned when EAX = 1 


Returned CR4 
EDX Contents* Feature Flag Bit 

EDX[0] 1 FPU On-Chip - 

EDX{(1] 0 Virtual Mode Extension 0,1 

EDX[2] 0 Debug Extensions 3 

EDX{[3] 0 Page Size Extensions 4 

EDX[4] 1 Time Stamp Counter 2 

EDX[5] 1 RDMSR / WRMSR 8 
Instructions 

EDX[6] 0 Physical Address 5 
Extensions 

EDX[7] 0 Machine Check Exception 6 

EDX[8] 1 CMPXCHG8B Instruction - 

EDX{[9] 0 On-Chip APIC Hardware = 

EDX[10] 0 Reserved - 

EDX[11] 0 SYSENTER / SYSEXIT - 
Instructions 

EDX[12] 0 Memory Type Range - 
Registers 

EDX[13] 0 Page Global Enable 7 

EDX[14] 0 Machine Check = 
Architecture 

EDX[15] 1 Conditional Move 
Instructions 

EDX[16] 0 Page Attribute Table ' 

EDX[22:17] 0 Reserved - 

EDX[23] 1 MMX™ Instructions - 

EDX[24] 0 Fast FPU Save and - 
Restore 

EDX[81:25] 0 Reserved - 

Note: *0 = Not supported 


CPUID Instruction 


9.2.1.3 CPUID Instruction with 


EAX = 0000 0002h 


Standard function 02h (EAX = 02h) of the CPUID 
instruction returns information that is specific to the 
Cyrix family of processors. Information about the 
TLB is returned in EAX as shown in Table 9-20. 
Information about the L1 cache is returned in EDX. 


Table 9-20 Standard CPUID with 
EAX = 0000 0002h 


Returned 
Register Contents Description 

EAX xx xx 70 xxh | TLB is 32 Entry, 4-way set asso- 
ciative, and has 4 KByte Pages 

EAX xx xx xx 01h | The CPUID instruction needs to 
be executed only once with an 
input value of 02h to retrieve 
complete information about the 
cache and TLB 

EBX Reserved 

ECX Reserved 

EDX xx xx xx 80h | L1 cache is 16 KBytes, 4-way 


set associated, and has 16 bytes 
per line. 
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9.2.2 Extended CPUID Levels Table 9-22 EAX, EBX, ECX CPUID Data 
Testing for extended CPUID instruction support Returned when EAX = 8000 0001h 
can be accomplished by executing a CPUID Returned 
instruction with the EAX register initialized to Register Contents Description 
8000 O0000h. If a value greater than or equal to EAX[3:0] Xx Stepping ID 
8000 0000h is returned to the EAX register by the EAX(7:4] 4 Model 
CPUID instruction, the processor supports EAX([11:8] 5 Family 
extended CPUID levels. EAX[15:12] 0 Processor Type 
EAX[31:16] = Reserved 

9.2.2.1 CPUID Instruction with EBs Resolved 

EAX = 8000 0000h ais : Reseed 


Extended function 8000 0000h (EAX = 
8000 0000h) of the CPUID instruction returns the 
maximum extended CPUID levels supported by the | Table 9-23 EDX CPUID Data Returned 


current processor in EAX (Table 9-21). The EBX, when EAX = 8000 0001h 
ECX, and EDX registers are currently reserved. Returned CR4 
EDX Contents* Feature Flag Bit 
Table 9-21 Maximum Extended CPUID Level EDX(0] 1 FPU On-Chip 
Returned EDX[1] 0 Virtual Mode Extension 0,1 
Register Contents Description EDX[2] 0 Debugging Extension 3 
EAX 8000 0005h_ | Maximum Extended CPUID EPA? p ee Size :Exension : 
Level (six levels) (4M) 
EBX 7 Reserved EDX[4] 1 Time Stamp Counter 2 
z EDX[5] 1 Cyrix Model-Specific Reg- 8 
shi Beseived isters (via RDMSR / 
EDX : Reserved WRMSR Instructions) 
EDX[6] 0 Reserved 5 
9.2.2.2 CPUID Instruction with EDX[7] 0 Machine Check Exception 6 
EAX _ 8000 0001 h EDX[8] 1 CMPXCHG8B Instruction 
cag EDX(9] 0 Reserved 
Extended function 8000 0001h (EAX = ; 
: ; EDX[10] 0 Reserved 
8000 0001h) of the CPUID instruction returns the 
; EDX[11] 0 SYSCALL / SYSRET 
processor type, family, model, and stepping naaeion 
information of the current processor in EAX. The EDX(12] 6 acawee = 
EBX and ECX registers are reserved. 
EDX[13] 0 Page Global Enable 7 
The extended feature flags supported are returned EDX[14] 0 Reserved 
in the EDX register as shown in Table 9-23. Each EDX[15] 1 Integer Conditional Move 
flag refers to a specific feature and indicates if that Instruction 
feature is present on the processor. Some of these EDX[16] ) FPU Conditional Move 
features have protection control in CR4. Before Instruction 
using any of these features on the processor, the EDX[22:17] 0 Reserved 
software should check the corresponding feature EDX[23] 1 MMX™ 
flag. EDX[24] 1 6x86MX Multimedia 
Extensions 


Note: 0 = Not supported 
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9.2.2.3, CPUID Instruction with 
EAX = 8000 0002h, 8000 0003h, 
8000 0004h 


Extended functions 8000 0002h through 8000 
0004h (EAX = 8000 0002h, EAX = 8000 0003h, 
EAX = 8000 0004h) of the CPUID instruction 
returns an ASCII string containing the name of the 
current processor. These functions eliminate the 
need to look up the processor name in a lookup 
table. Software can simply call these functions to 
obtain the name of the processor. The string may 
be 48 ASCII characters long, and is returned in 
little endian format. If the name is shorter than 48 
characters long, the remaining bytes will be filled 
with ASCII NUL character (00h). 


Table 9-24 Official CPU Name 


8000 0002h 8000 0003h 8000 0004h 
EAX | CPU EAX | CPU EAX | CPU 
Name 1 Name 5 Name 9 
EBX | CPU EBX | CPU EBX | CPU 
Name 2 Name 6 Name 10 
ECX | CPU ECX | CPU ECX | CPU 
Name 3 Name 7 Name 11 
EDX | CPU EDX | CPU EDX | CPU 
Name 4 Name 8 Name 12 


9.2.2.4 CPUID Instruction with 

EAX = 8000 0005h 
Extended function 8000 0005h (EAX = 8000 
0005h) of the CPUID instruction returns informa- 
tion about the TLB and L1 cache to be looked up in 
a lookup table. Refer to Table 9-25. 


Table 9-25 Standard CPUID with 
EAX = 8000 0005h 


Returned 
Register Contents Description 

EAX -- Reserved 

EBX xx xx 70 xxh_ | TLB is 32 Entry, 4-way set 
associative, and has 4 KByte 
Pages 

EBX xx xx xx O1h | The CPUID instruction needs 
to be executed only once with 
an input value of 02h to 
retrieve complete information 
about the cache and TLB 

ECX xx xx xx 80h | L1 cache is 16 KBytes, 4-way 
set associated, and has 16 
bytes per line. 

EDX -- Reserved 
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9.3 Processor Core Instruction Set 


The instruction set for the MediaGX processor core 
is summarized in Table 9-27. The table uses 
several symbols and abbreviations that are 
described next and listed in Table 9-26. 


Opcodes 


Opcodes are given as hex values except when 
they appear within brackets as binary values. 


Clock Counts 


The clock counts listed in the instruction set 
summary table are grouped by operating mode 
(Real and Protected) and whether there is a 
register/cache hit or a cache miss. In some cases, 
more than one clock count is shown in a column for 
a given instruction, or a variable is used in the 
clock count. 


Flags 


There are nine flags that are affected by the execu- 
tion of instructions. The flag names have been abbrevi- 
ated and various conventions used to indicate what 
effect the instruction has on the particular flag. 


GXm_db_v2.0 


Table 9-26 Processor Core Instruction Set 
Table Legend 
Symbol or 
Abbreviation Description 
Opcode 
# Immediate 8-bit data 
## Immediate 16-bit data 
HHH Full immediate 32-bit data (8, 16, 32 bits) 
+ 8-bit signed displacement 
+++ Full signed displacement (16, 32 bits) 
Clock Count 
/ Register operand/memory operand. 
n Number of times operation is repeated. 
L Level of the stack frame. 
| Conditional jump taken | Conditional jump not 
taken. 
(e.g. “4|1” = 4 clocks if jump taken, 1 clock if 
jump not taken) 
\ CPL < IOPL \ CPL > IOPL 
(where CPL = Current Privilege Level, IOPL 
= I/O Privilege Level) 
Flags 
OF Overflow Flag 
DF Direction Flag 
IF Interrupt Enable Flag 
TF Trap Flag 
SF Sign Flag 
ZF Zero Flag 
AF Auxiliary Flag 
PF Parity Flag 
CF Carry Flag 
Xx Flag is modified by the instruction. 
Flag is not changed by the instruction. 
0 Flag is reset to “0”. 
1 Flag is set to “1”. 


Flag is undefined following execution the 
instruction. 
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Table 9-27 Processor Core Instruction Set Summary 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
oD T S Z A P C} Clock Count 
Instruction Opcode FF F F F F F F | (Reg/Cache Hit) Notes 
AAA ASCII Adjust AL after Add 37 ue - uu xX _U X 3 3 
AAD ASCII Adjust AX before Divide D5 0A Up - X X UX _U 7 7 
AAM ASCII Adjust AX after Multiply D4 0A u - - x x ux ul 19 19 
AAS ASCIl Adjust AL after Subtract 3F u - 20 Ue Xk WX 3 3 
ADC Add with Carry 
Register to Register 1 [00dw] [11 reg r/m] X + ~ xX X xX xX Xx 1 1 b h 
Register to Memory 1 [000w] [mod reg r/m] 1 1 
Memory to Register 1 [001w] [mod reg r/m] 1 1 
Immediate to Register/Memory 8 [O0sw] [mod 010 r/m]### 1 1 
Immediate to Accumulator 1 [010w] ### 1 1 
ADD Integer Add 
Register to Register 0 [00dw] [11 reg r/m] xX + ~ xk * xX xX Xx 1 1 b h 
Register to Memory 0 [000w] [mod reg r/m] 1 1 
Memory to Register 0 [001w] [mod reg r/m] 1 1 
Immediate to Register/Memory 8 [O0sw] [mod 000 r/m]### 1 1 
Immediate to Accumulator 0 [010w] ### 1 1 
AND Boolean AND 
Register to Register 2 [00dw] [11 reg r/m] 0 - - xX x ux 0 1 1 b h 
Register to Memory 2 [000w] [mod reg r/m] 1 1 
Memory to Register 2 [001w] [mod reg r/m] 1 1 
Immediate to Register/Memory 8 [O0sw] [mod 100 r/m]### 1 1 
Immediate to Accumulator 2 [010w] ### 1 1 
ARPL Adjust Requested Privilege Level 
From Register/Memory 63 [mod reg r/m] - a 9 a h 
BBO_Reset Set BLT Buffer 0 Pointer to the Base |OF 3A 2 2 
BB1_Reset Set BLT Buffer 1 Pointer to the Base |OF 3B 2 2 
BOUND Check Array Boundaries 
If Out of Range (Int 5) 62 [mod reg r/m] - - - - = = = = | 8+INT | 8+INT | b,e | g,h,j,kr 
If In Range fs fs 
BSF Scan Bit Forward 
Register, Register/Memory OF BC [mod reg r/m] ~ 4 - = X - - = | 4/9+n | 4/9+n b h 
BSR Scan Bit Reverse 
Register, Register/Memory OF BD [mod reg r/m] ~ - = xX = = = | 4/11+n | 4/11+Nn b h 
BSWAP Byte Swap OF C[1 reg] - - - oe oe eee 6 6 
BT Test Bit 
Register/Memory, Immediate OF BA [mod 100 r/m]# - ee ¢ 1 1 b h 
Register/Memory, Register OF A3 [mod reg r/m] 1/7 1/7 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 


Flags Mode | Mode | Mode} Mode 
ODI TS ZA P Cy Clock Count 
Instruction Opcode F F F F F F F F F |(Reg/Cache Hit) Notes 
BTC Test Bit and Complement 
Register/Memory, Immediate OF BA [mod 111 r/m]# 4 2 2 b h 
Register/Memory, Register OF BB [mod reg r/m] 2/8 2/8 
BTR Test Bit and Reset 
Register/Memory, Immediate OF BA [mod 110 r/m]# Se 2 2 b h 
Register/Memory, Register OF B3 [mod reg r/m 2/8 2/8 
BTS Test Bit and Set 
Register/Memory OF BA [mod 101 r/m] me mmm 2 2 b h 
Register (short form) OF AB [mod reg r/m] 2/8 2/8 


CALL Subroutine Call 


Direct Within Segment E8 +++ ee ee ea 3 3 b h,j.k,r 
Register/Memory Indirect Within Segment FF [mod 010 r/m] 3/4 3/4 
Direct Intersegment 9A [unsigned full offset, 9 14 
-Call Gate to Same Privilege selector] 24 
-Call Gate to Different Privilege No Par’s 45 
-Call Gate to Different Privilege m Par’s 51+2m 
-16-bit Task to 16-bit TSS 183 
-16-bit Task to 32-bit TSS 189 
-16-bit Task to V86 Task 123 
-32-bit Task to 16-bit TSS 186 
-32-bit Task to 32-bit TSS 192 
-32-bit Task to V86 Task 126 
Indirect Intersegment FF [mod 011 r/m] 11 15 
-Call Gate to Same Privilege 25 
-Call Gate to Different Privilege No Par’s 46 
-Call Gate to Different Privilege m Par’s 52+2m 
-16-bit Task to 16-bit TSS 184 
-16-bit Task to 32-bit TSS 190 
-16-bit Task to V86 Task 124 
-32-bit Task to 16-bit TSS 187 
-32-bit Task to 32-bit TSS 193 
-32-bit Task to V86 Task 127 
CBW Convert Byte to Word 98 SOS fael oe) oS. 
CDQ Convert Doubleword to Quadword 99 - oe eee eee 2 2 
CLC Clear Carry Flag F8 ee 0) 1 1 
CLD Clear Direction Flag FC ~ Q@- =~ 2 oe ew ee 4 4 
CLI Clear Interrupt Flag FA - - O - - - - = - 6 6 m 
CLTS Clear Task Switched Flag OF 06 7 7 c | 
CMC Complement the Carry Flag F5 4 3 3 


CMOVA/CMOVNBE Move if Above/Not Below or Equal 


Register, Register/Memory OF 47 [mod reg r/m] mm me 1 1 r 
CMOVBE/CMOVNA Move if Below or Equal/Not Above 

Register, Register/Memory OF 46 [mod reg r/m] mm ms 1 1 r 
CMOVAE/CMOVNB/CMOVNC Move if Above or Equal/Not Below/Not Carry 

Register, Register/Memory OF 43 [mod reg r/m] me 1 1 ig 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
OD TSZA Clock Count 
Instruction Opcode F F F F F F (Reg/Cache Hit) Notes 

CMOVB/CMOVC/CMOVNAE Move if Below/Carry/Not Above or Equal 

Register, Register/Memory OF 42 [mod reg r/m] ~ 8 see 1 1 r 
CMOVE/CMOVZ Move if Equal/Zero 

Register, Register/Memory OF 44 [mod reg r/m] os a mom 1 1 r 
CMOVNE/CMOVNZ Move if Not Equal/Not Zero 

Register, Register/Memory OF 45 [mod reg r/m] mos a se 1 1 r 
CMOVG/CMOVNLE Move if Greater/Not Less or Equal 

Register, Register/Memory OF 4F [mod reg r/m] os some 1 1 r 
CMOVLE/CMOVNG Move if Less or Equal/Not Greater 

Register, Register/Memory OF 4E [mod reg r/m] mos a ome 1 1 r 
CMOVL/CMOVNGE Move if Less/Not Greater or Equal 

Register, Register/Memory OF 4C [mod reg r/m] mos a om 1 1 r 
CMOVGE/CMOVNL Move if Greater or Equal/Not Less 

Register, Register/Memory OF 4D [mod reg r/m] mos oe 1 1 r 
CMOVO Move if Overflow 

Register, Register/Memory OF 40 [mod reg r/m] = oe 1 1 f 
CMOVNO Move if No Overflow 

Register, Register/Memory OF 41 [mod reg r/m] so oe 1 1 r 
CMOVP/CMOVPE Move if Parity/Parity Even 

Register, Register/Memory OF 4A [mod reg r/m] = 4 ooo 1 1 r 
CMOVNP/CMOVPO Move if Not Parity/Parity Odd 

Register, Register/Memory OF 4B [mod reg r/m] = 8 oe 1 1 r 
CMOVS Move if Sign 

Register, Register/Memory OF 48 [mod reg r/m] = 8 ote 1 7 4 
CMOVNS Move if Not Sign 

Register, Register/Memory OF 49 [mod reg r/m] =e eo 1 1 r 
CMP Compare Integers 

Register to Register 3 [10dw] [11 reg r/m] X o> ~ xX xX Xx 1 1 b h 

Register to Memory 3 [101w] [mod reg r/m] 1 1 

Memory to Register 3 [100w] [mod reg r/m] 1 1 

Immediate to Register/Memory 8 [O0sw] [mod 111 r/m] ### 1 1 

Immediate to Accumulator 3 [110w] ### 1 1 
CMPS Compare String A [011 w] x - - xX X X 6 6 b h 
CMPXCHG Compare and Exchange 

Register1, Register2 OF B [000w] [11 reg2 reg1] X - - xX X X 6 6 

Memory, Register OF B [000w] [mod reg r/m] 6 6 
CMPXCHG8B Compare and Exchange 8 Bytes |OF C7 [mod 001 r/m] - - - oe oe 
CPUID CPU Identification OF A2 - - - oe ee 12 12 
CPU_READ Read Special CPU Register OF 3C 1 1 
CPU_WRITE Write Special CPU Register OF 3D 1 1 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
: ODI TSZAP Clock Count 
Instruction Opcode PoP OF PUP OF FS (Reg/Cache Hit) Notes 
CWD Convert Word to Doubleword 99 yee ee eee 2 2 
CWDE Convert Word to Doubleword Extended 98 _— se eee ee 3 3 
DAA Decimal Adjust AL after Add 27 aa oR gee KE KE OK 2 2 
DAS Decimal Adjust AL after Subtract 2F - = = = X xX X xX 2 2 
DEC Decrement by 1 
Register/Memory F [111w] [mod 001 r/m] X = = =» KX KX KX 1 1 b h 
Register (short form) 4[1 reg] 1 1 
DIV Unsigned Divide 
Accumulator by Register/Memory F [011w] [mod 110 r/m] - - = = X xX UU b,e e,h 
Divisor: Byte 20 20 
Word 29 29 
Doubleword 45 45 
ENTER Enter New Stack Frame 
Level = 0 C8 ##.# ee ee eee 13 13 b h 
Level = 1 17 17 
Level (L) > 1 17+2*L | 17+2*L 
HLT Halt F4 ee ee eee 10 10 
IDIV Integer (Signed) Divide 
Accumulator by Register/Memory F [011w] [mod 111 r/m] - - = = X xX UU b,e e,h 
Divisor: Byte 20 20 
Word 29 29 
Doubleword 45 45 
IMUL /nteger (Signed) Multiply 
Accumulator by Register/Memory F [011w] [mod 101 r/m] X - - = X X UU b h 
Multiplier: Byte 4 4 
Word 5 5 
Doubleword 15 15 
Register with Register/Memory OF AF [mod reg r/m] 
Multiplier: Word 5 5 
Doubleword 18 15 
Register/Memory with Immediate to Register2 | 6 [10s1] [mod reg r/m] ### 
Multiplier: Word 6 6 
Doubleword 16 16 
IN Input from I/O Port 
Fixed Port E [010w] # eee ee eee 8 8/22 m 
Variable Port E [110w] 8 8/22 
INS Input String from I/O Port 6 [110w] eee Fe aE eS 11 11/25 b h,m 
INC Increment by 1 
Register/Memory F [111w] [mod 000 r/m] X > = = KX KX KX 1 1 b h 
Register (short form) 4 [0 reg] 1 1 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 


Flags Mode | Mode | Mode | Mode 
: ODI TS ZA P Cy Clock Count 
Instruction Opcode F F F F F F F F F |(Reg/Cache Hit) Notes 
INT Software Interrupt 
INT i CD# - - x O- - - - - 19 b,e g,j,k,r 
Protected Mode: 
-Interrupt or Trap to Same Privilege 33 
-Interrupt or Trap to Different Privilege 55 
-16-bit Task to 16-bit TSS by Task Gate 184 
-16-bit Task to 32-bit TSS by Task Gate 190 
-16-bit Task to V86 by Task Gate 124 
-16-bit Task to 16-bit TSS by Task Gate 187 
-32-bit Task to 32-bit TSS by Task Gate 193 
-32-bit Task to V86 by Task Gate 127 
-V86 to 16-bit TSS by Task Gate 187 
-V86 to 32-bit TSS by Task Gate 193 
-V86 to Privilege 0 by Trap Gate/Int Gate 64 
INT 3 CC INT INT 
INTO CE 
If OF==0 4 4 
If OF==1 (INT 4) INT INT 
INVD /nvalidate Cache OF 08 Bee fee Gh dime eigen = 20 20 t t 
INVLPG /nvalidate TLB Entry OF 01 [mod 111 r/m] ee ee ee eee 15 15 
IRET /nterrupt Return 
Real Mode CF x x KX xX X X X KX X 13 g,h,j,k,r 
Protected Mode: 
-Within Task to Same Privilege 20 
-Within Task to Different Privilege 39 
-16-bit Task to 16-bit Task 169 
-16-bit Task to 32-bit TSS 175 
-16-bit Task to V86 Task 109 
-32-bit Task to 16-bit TSS 172 
-32-bit Task to 32-bit TSS 178 
-32-bit Task to V86 Task 112 


JB/JNAE/JC Jump on Below/Not Above or Equal/Carry 


8-bit Displacement 72+ 1 1 r 
Full Displacement OF 82 +++ 1 1 
JBE/JNA Jump on Below or Equal/Not Above 
8-bit Displacement 76+ eee eee 1 1 r 
Full Displacement OF 86 +++ 1 1 
JCXZ/JECXZ Jump on CX/ECX Zero E3 + Se 2 2 r 
JE/JZ Jump on Equal/Zero 
8-bit Displacement 744 eee eee 1 1 r 
Full Displacement OF 84 +++ 1 1 
JL/JNGE Jump on Less/Not Greater or Equal 
8-bit Displacement 70+ eee eee 1 1 r 
Full Displacement OF 8C +++ 1 1 
JLE/JNG Jump on Less or Equal/Not Greater 
8-bit Displacement 7E+ eee ee eee 1 1 r 
Full Displacement OF 8E +++ 1 1 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
: ODI TS ZA P Cy Clock Count 
Instruction Opcode F F F F F F F F F |(Reg/Cache Hit) Notes 
JMP Unconditional Jump 
8-bit Displacement EB + eee ee eee 1 1 b h,j,.k,r 
Full Displacement EQ +++ 1 1 
Register/Memory Indirect Within Segment FF [mod 100 r/m] 1/3 1/3 
Direct Intersegment EA [unsigned full offset, 8 12 
-Call Gate Same Privilege Level selector] 22 
-16-bit Task to 16-bit TSS 186 
-16-bit Task to 32-bit TSS 192 
-16-bit Task to V86 Task 126 
-32-bit Task to 16-bit TSS 189 
-32-bit Task to 32-bit TSS 195 
-32-bit Task to V86 Task 129 
Indirect Intersegment FF [mod 101 r/m] 10 13 
-Call Gate Same Privilege Level 23 
-16-bit Task to 16-bit TSS 187 
-16-bit Task to 32-bit TSS 193 
-16-bit Task to V86 Task 127 
-32-bit Task to 16-bit TSS 190 
-32-bit Task to 32-bit TSS 196 
-32-bit Task to V86 Task 130 
JNB/JAE/JNC Jump on Not Below/Above or Equal/Not Carry 
8-bit Displacement 73 + eee eee 1 1 r 
Full Displacemen OF 83 +++ 1 1 
JNBE/JA Jump on Not Below or Equal/Above 
8-bit Displacement 77+ ee eee eee 1 1 r 
Full Displacemen OF 87 +++ 1 1 
JNE/JNZ Jump on Not Equal/Not Zero 
8-bit Displacement 75+ ee eee eee 1 1 r 
Full Displacemen OF 85 +++ 1 1 
JNL/JGE Jump on Not Less/Greater or Equal 
8-bit Displacement 7D + me 1 1 r 
Full Displacement OF 8D +++ 1 1 
JNLE/JG Jump on Not Less or Equal/Greater 
8-bit Displacement 7F + eee ee eee 1 1 r 
Full Displacement OF 8F +++ 1 1 
JNO Jump on Not Overflow 
8-bit Displacement 71+ Se 1 1 r 
Full Displacement OF 81 +++ 1 1 
JNP/JPO Jump on Not Parity/Parity Odd 
8-bit Displacement 7B+ ~ Se eee 1 1 r 
Full Displacement OF 8B +++ 1 1 
JNS Jump on Not Sign 
8-bit Displacement 79+ eee ee eee 1 1 r 
Full Displacement OF 89 +++ 1 1 
JO Jump on Overflow 
8-bit Displacement 70+ - oe eee eee 1 1 r 
Full Displacement OF 80 +++ 1 1 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
ooD T S Z A P C} Clock Count 
Instruction Opcode Paik F F F F F F |(Reg/Cache Hit) Notes 
JP/JPE Jump on Parity/Parity Even 
8-bit Displacement 7A+ “8 - ee 1 1 r 
Full Displacement OF 8A +++ 1 1 
JS Jump on Sign 
8-bit Displacement 78+ - - ye eee 1 1 r 
Full Displacement OF 88 +++ 1 1 
LAHF Load AH with Flags OF - - - oe ee ee 2 2 
LAR Load Access Rights 
From Register/Memory OF 02 [mod reg r/m] - a g,h,j,p 
LDS Load Pointer to DS C5 [mod reg r/m] - - - oe ee ee 4 9 b hi,j 
LEA Load Effective Address 
No Index Register 8D [mod reg r/m] - - - oe ee ee 1 1 
With Index Register 1 1 
LES Load Pointer to ES C4 [mod reg r/m] - - ye ee ee 4 9 b hi,j 
LFS Load Pointer to FS OF B4 [mod reg r/m] - - - oe ee ee 4 9 b hij 
LGDT Load GDT Register OF 01 [mod 010 r/m] - - yee eee 10 10 b,c hl 
LGS Load Pointer to GS OF B5 [mod reg r/m] - - yee eee 4 9 b hij 
LIDT Load IDT Register OF 01 [mod 011 r/m] - - ye ee ee 10 10 b,c hl 
LLDT Load LDT Register 
From Register/Memory OF 00 [mod 010 r/m] - yee eee 8 a g,h,j,| 
LMSW Load Machine Status Word 
From Register/Memory OF 01 [mod 110 r/m] soe S) gURA WES ie sea The 11 11 b,c hl 
LODS Load String A [110 w] - - - oe oe eee 3 3 b h 
LSL Load Segment Limit 
From Register/Memory OF 03 [mod reg r/m] - - a g,h,j,p 
LSS Load Pointer to SS OF B2 [mod reg r/m] - - ye ee ee 4 10 a hij 
LTR Load Task Register 
From Register/Memory OF 00 [mod 011 r/m] - - oe ee ee 9 a g,h,j,| 
LEAVE Leave Current Stack Frame cg - - - oe oe eee 4 4 b h 
LOOP Offset Loop/No Loop E2+ - - - oe oe eee 2 2 r 
LOOPNZ/LOOPNE Offset EO + - - - oe ee ee 2 r 
LOOPZ/LOOPE Offset E1+ - - - oe ee ee 2 2 r 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
ODI T S$ Z A P C} Clock Count 
Instruction Opcode F F F F F F F F F |(Reg/Cache Hit) Notes 
MOV Move Data 
Register to Register 8 [10dw] [11 reg r/m] 1 1 b hi,j 
Register to Memory 8 [100w] [mod reg r/m] 1 1 
Register/Memory to Register 8 [101w] [mod reg r/m] 1 1 
Immediate to Register/Memory C [011w] [mod 000 r/m] ### 1 1 
Immediate to Register (short form) B [w reg] ### 1 1 
Memory to Accumulator (short form) A [000w] +++ 1 1 
Accumulator to Memory (short form) A [001 w] +++ 1 1 
Register/Memory to Segment Register 8E [mod sreg3 r/m] 1 6 
Segment Register to Register/Memory 8C [mod sreg3 r/m] 1 1 
MOV Move to/from Control/Debug/Test Regs 
Register to CRO/CR2/CR3/CR4 OF 22 [11 eee reg] - oe 2 eee = = = | 20/5/5 | 18/5/6 I 
CRO/CR2/CR3/CR4 to Register OF 20 [11 eee reg] 6 6 
Register to DRO-DR3 OF 23 [11 eee reg] 10 10 
DRO-DR3 to Register OF 21 [11 eee reg] 9 9 
Register to DR6-DR7 OF 23 [11 eee reg] 10 10 
DR6-DR7 to Register OF 21 [11 eee reg] 9 9 
Register to TR3-5 OF 26 [11 eee reg] 16 16 
TR3-5 to Register OF 24 [11 eee reg] 8 8 
Register to TR6-TR7 OF 26 [11 eee reg] 11 11 
TR6-TR7 to Register OF 24 [11 eee reg] 3 3 
MOVS Move String A [010w] ee ee ee eee 6 6 b h 
MOVSX Move with Sign Extension 
Register from Register/Memory OF B[111w] [mod reg r/m] mal Ss Sai veh Ge eo 1 1 b h 
MOVZX Move with Zero Extension 
Register from Register/Memory OF B[011w] [mod reg r/m] Se 1 1 b h 
MUL Unsigned Multiply 
Accumulator with Register/Memory F [011w] [mod 100 r/m] X - - = X X UU X b h 
Multiplier: Byte 4 4 
Word a o 
Doubleword 15 15 
NEG Negate Integer F [011w] [mod 011 r/m] X= = = X¥ KX KX xX 1 1 b h 
NOP No Operation 90 1 1 
NOT Boolean Complement F [011w] [mod 010 r/m] ee 1 1 b h 
OIO Official Invalid Opcode OF FF - - x O- - - - - 1 8-125 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
ooD TSZA Clock Count 
Instruction Opcode F F F F F F (Reg/Cache Hit) Notes 

OR Boolean OR 

Register to Register 0 [10dw] [11 reg r/m] 0 - - xX X U 1 1 b h 

Register to Memory 0 [100w] [mod reg r/m] 1 1 

Memory to Register 0 [101w] [mod reg r/m] 1 1 

Immediate to Register/Memory 8 [O0sw] [mod 001 r/m] ### 1 1 

Immediate to Accumulator 0 [110w] ### 1 1 
OUT Output to Port 

Fixed Port E [011w] # - - - oe ee 14 14/28 m 

Variable Port E [111w] 14 14/28 
OUTS Output String 6 [111w] - - - ose 15 15/29 b h,m 
POP Pop Value off Stack 

Register/Memory 8F [mod 000 r/m] - - oe ee 1/4 1/4 b hi,j 

Register (short form) 5 [1 reg] 1 1 

Segment Register (ES, SS, DS) [000 sreg2 111] 1 6 

Segment Register (FS, GS) OF [10 sreg3 001] 1 6 
POPA Pop All General Registers 61 < + ~ ems 9 9 b h 
POPF Pop Stack into FLAGS 9D x xX x X X X 8 8 b h,n 
PREFIX BYTES 

Assert Hardware LOCK Prefix FO - - - eee m 

Address Size Prefix 67 

Operand Size Prefix 66 

Segment Override Prefix 

-CS 2E 

-DS 3E 

-ES 26 

-FS 64 

-GS 65 

-SS 36 
PUSH Push Value onto Stack 

Register/Memory FF [mod 110 r/m] = + ee 1/3 1/3 b h 

Register (short form) 5 [0 reg] 1 1 

Segment Register (ES, CS, SS, DS) [000 sreg2 110] 1 1 

Segment Register (FS, GS) OF [10 sreg3 000] 1 1 

Immediate 6 [10s0] ### 1 1 
PUSHA Push All General Registers 60 - - - oe oe 11 11 b h 
PUSHF Push FLAGS Register 9C - - - oe ee 2 2 b h 
RCL Rotate Through Carry Left 

Register/Memory by 1 D [000w] [mod 010 r/m] xX - - oe ee 3 3 b h 

Register/Memory by CL D [001w] [mod 010 r/m] u- - oe ee 8 8 

Register/Memory by Immediate C [000w] [mod 010 r/m] # u- - oe ee 8 8 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
ODI T A Clock Count 
Instruction Opcode F F F F F (Reg/Cache Hit) Notes 

RCR Rotate Through Carry Right 

Register/Memory by 1 D [000w] [mod 011 r/m] X - - + - 4 4 b h 

Register/Memory by CL D [001w] [mod 011 r/m] u- - - - 8 8 

Register/Memory by Immediate C [000w] [mod 011 r/m] # u- - - - 8 8 
RDMSR Read Tmodel Specific Register OF 32 - oe ee - 
RDTSC Read Time Stamp Counter OF 31 - oe ee - 
REP INS /nput String F3 6[110w] - eee - 17+4n | 17+4n\ b h,m 

32+4n 
REP LODS Load String F3 A[110w] - os ee - 9+2n | 9+2n b h 
REP MOVS Move String F3 A[010w] - eee - 12+2n | 12+2n b h 
REP OUTS Output String F3 6[111w] - oe ee - 244+4n | 24+4n\ b h,m 
39+4n 

REP STOS Siore String F3 A[101w] - foe - 9+2n 9+2n b h 
REPE CMPS Compare String 

Find non-match F3 A[011w] , a x 11+4n | 11+4n b h 
REPE SCAS Scan String 

Find non-AL/AX/EAX F3 A[111w] Xx 9+3n | 9+3n b h 
REPNE CMPS Compare String 

Find match F2 A[011w] Xx 11+4n | 11+4n b h 
REPNE SCAS Scan String 

Find AL/AX/EAX F2 A[111w] a Xx 9+3n | 9+3n b h 
RET Return from Subroutine 

Within Segment C3 - + + - 3 3 b g,h,j.k,r 

Within Segment Adding Immediate to SP C2 ## 3 3 

Intersegment CB 10 13 

Intersegment Adding Immediate to SP CA ## 10 13 

Protected Mode: Different Privilege Level 

-Intersegment 35 

-Intersegment Adding Immediate to SP 35 
ROL Rotate Left 

Register/Memory by 1 D[000w] [mod 000 r/m X - - + - b h 

Register/Memory by CL D[001w] [mod 000 r/m Us = - - 

Register/Memory by Immediate C[000w] [mod 000 r/m] # Us = = ? 
ROR Rotate Right 

Register/Memory by 1 D[000w] [mod 001 r/m X - 2 - - b h 

Register/Memory by CL D[001w] [mod 001 r/m Un = - - 

Register/Memory by Immediate C[000w] [mod 001 r/m] # u- - - - 
RSDC Restore Segment Register and Descriptor |OF 79 [mod sreg3 r/m] - oe ee - 11 11 s Ss 
RSLDT Restore LDTR and Descriptor OF 7B [mod 000 r/m] af eT - 11 11 s s 
RSTS Restore TSR and Descriptor OF 7D [mod 000 r/m] - oe oe - 11 11 s Ss 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
ooD TSZA Clock Count 
Instruction Opcode F F F F F F (Reg/Cache Hit) Notes 

RSM Resume from SMM Mode OF AA x xX x xX X X 57 57 s s 
SAHF Store AH in FLAGS 9E - - et I 1 1 
SAL Shift Left Arithmetic 

Register/Memory by 1 D[000w] [mod 100 r/m] Xx - - xX X U 1 1 b h 

Register/Memory by CL D[001w] [mod 100 r/m] u - - xX XU 2 2 

Register/Memory by Immediate C[O00w] [mod 100 r/m] # u - - xX X U 1 1 
SAR Shift Right Arithmetic 

Register/Memory by 1 D[000w] [mod 111 r/m] x - - xX X U b h 

Register/Memory by CL D[001w] [mod 111 r/m] u - ~ xX xX U 

Register/Memory by Immediate C[O000w] [mod 111 r/m] # u = ~ xX xX wu 
SBB /nteger Subtract with Borrow 

Register to Register 1[10dw] [11 reg r/m] x + - X X X 1 1 b h 

Register to Memory 1[100w] [mod reg r/m] 1 1 

Memory to Register 1[101w] [mod reg r/m] 1 1 

Immediate to Register/Memory 8[00sw] [mod 011 r/m] ### 1 1 

Immediate to Accumulator (short form) 1[110w] ### 1 1 
SCAS Scan String A [111w] x - - xX X XxX 2 2 b h 
SETB/SETNAE/SETC Set Byte on Below/Not Above or Equal/Carry 

To Register/Memory OF 92 [mod 000 r/m] - - of ee 1 1 h 
SETBE/SETNA Set Byte on Below or Equal/Not Above 

To Register/Memory OF 96 [mod 000 r/m] - - - oe ee 1 1 h 
SETE/SETZ Set Byte on Equal/Zero 

To Register/Memory OF 94 [mod 000 r/m] - - - oe ee 1 1 h 
SETL/SETNGE Set Byte on Less/Not Greater or Equal 

To Register/Memory OF 9C [mod 000 r/m] - - - oe ee 1 1 h 
SETLE/SETNG Set Byte on Less or Equal/Not Greater 

To Register/Memory OF 9E [mod 000 r/m - - oe ee 1 1 h 
SETNB/SETAE/SETNC Set Byte on Not Below/Above or Equal/Not Carry 

To Register/Memory OF 93 [mod 000 r/m - - - oe ee 1 1 h 
SETNBE/SETA Set Byte on Not Below or Equal/Above 

To Register/Memory OF 97 [mod 000 r/m - - oe ee 1 1 h 
SETNE/SETNZ Set Byte on Not Equal/Not Zero 

To Register/Memory OF 95 [mod 000 r/m - - - of ee 1 1 h 
SETNL/SETGE Set Byte on Not Less/Greater or Equal 

To Register/Memory OF 9D [mod 000 r/m] - - - oe oe 1 1 h 
SETNLE/SETG Set Byte on Not Less or Equal/Greater 

To Register/Memory OF 9F [mod 000 r/m] - - - oe oe 1 1 h 
SETNO Set Byte on Not Overflow 

To Register/Memory OF 91 [mod 000 r/m] - - - ose 1 1 h 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
ODI T S$ Z A P Cj} Clock Count 
Instruction Opcode F F F F F F F F F |(Reg/Cache Hit) Notes 

SETNP/SETPO Set Byte on Not Parity/Parity Odd 

To Register/Memory OF 9B [mod 000 r/m 1 1 h 
SETNS Set Byte on Not Sign 

To Register/Memory OF 99 [mod 000 r/m ee 1 1 h 
SETO Set Byte on Overflow 

To Register/Memory OF 90 [mod 000 r/m 1 1 h 
SETP/SETPE Set Byte on Parity/Parity Even 

To Register/Memory OF 9A [mod 000 r/m ee 1 1 h 
SETS Set Byte on Sign 

To Register/Memory OF 98 [mod 000 r/m ee 1 1 h 
SGDT Store GDT Register 

To Register/Memory OF 01 [mod 000 r/m ee 6 6 b,c h 
SIDT Store IDT Register 

To Register/Memory OF 01 [mod 001 r/m 6 6 b,c h 
SLDT Store LDT Register 

To Register/Memory OF 00 [mod 000 r/m a 1 a h 
STR Store Task Register 

To Register/Memory OF 00 [mod 001 r/m 3 a h 
SMSW Store Machine Status Word OF 01 [mod 100 r/m 4 4 b,c h 
STOS Store String A [101w] - eee 2 2 b h 
SHL Shift Left Logical 

Register/Memory by 1 D [000w] [mod 100 r/m] X - - - X X UX xX 1 1 b h 

Register/Memory by CL D [001w] [mod 100 r/m] u- - - X xX UX X 2 2 

Register/Memory by Immediate C [000w] [mod 100 r/m] # u- - - X X UX x 1 1 
SHLD Shift Left Double 

Register/Memory by Immediate OF A4 [mod reg r/m] # uy =» ®& X Ux xX 3 3 b h 

Register/Memory by CL OF A5 [mod reg r/m] 
SHR Shift Right Logical 

Register/Memory by 1 D [000w] [mod 101 r/m] Xu = & ke R uk xX 2 2 b h 

Register/Memory by CL D [001w] [mod 101 r/m] u- - - X xX UX xX 

Register/Memory by Immediate C [000w] [mod 101 r/m] # u- - - X X UX Xx 2 2 
SHRD Shift Right Double 

Register/Memory by Immediate OF AC [mod reg r/m] # u- - - X X UX XxX 3 3 b h 

Register/Memory by CL OF AD [mod reg r/m] 6 6 
SMINT Software SMM Entry OF 38 84 84 s s 
STC Set Carry Flag FQ ee | 1 1 
STD Set Direction Flag FD - 1 - - - - ee 
STI Set Interrupt Flag FB Se 6 6 m 
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Table 9-27 Processor Core Instruction Set Summary (cont.) 


Processor Core Instruction Set 


Real | Prot’d | Real | Prot’d 
Flags Mode | Mode | Mode | Mode 
: D TS A Clock Count 
Instruction Opcode F F F F (Reg/Cache Hit) Notes 
SUB /nteger Subtract 
Register to Register 2 [10dw] [11 reg r/m] - - xX x 1 1 b h 
Register to Memory 2 [100w] [mod reg r/m] 1 1 
Memory to Register 2 [101w] [mod reg r/m] 1 1 
Immediate to Register/Memory 8 [O0sw] [mod 101 r/m] ### 1 1 
Immediate to Accumulator (short form) 2 [110w] ### 1 1 
SVDC Save Segment Register and Descriptor OF 78 [mod sreg3 r/m] - - - - 20 20 s Ss 
SVLDT Save LDTR and Descriptor OF 7A [mod 000 r/m] - - - 20 20 s Ss 
SVTS Save TSR and Descriptor OF 7C [mod 000 r/m] - - - - 21 21 s Ss 
TEST Test Bits 
Register/Memory and Register 8 [010w] [mod reg r/m] - - xX u 1 1 b h 
Immediate Data and Register/Memory F [011w] [mod 000 r/m] ### 1 1 
Immediate Data and Accumulator A [100w] ### 1 1 
VERR Verify Read Access 
To Register/Memory OF 00 [mod 100 r/m] - - - - 8 a g,h,j,p 
VERW Verify Write Access 
To Register/Memory OF 00 [mod 101 r/m] - - - 8 a g,h,j,P 
WAIT Wait Until FPU Not Busy 9B - - + - 1 1 
WBINVD Write-Back and Invalidate Cache OF 09 - - - - 23 23 t t 
WRMSR Write to Model Specific Register OF 30 = = = 
XADD Exchange and Add 
Register1, Register2 OF C[000w] [11 reg2 reg] - - xX x 2 2 
Memory, Register OF C[000w] [mod reg r/m] 2 2 
XCHG Exchange 
Register/Memory with Register 8[011w] [mod reg r/m] - - - 2 2 b,f f,h 
Register with Accumulator 9[0 reg] 2 2 
XLAT Translate Byte D7 - - - - 5 5 h 
XOR Boolean Exclusive OR 
Register to Register 3 [00dw] [11 reg r/m] - - xX U 1 1 b h 
Register to Memory 3 [000w] [mod reg r/m] 1 1 
Memory to Register 3 [001w] [mod reg r/m] 1 1 
Immediate to Register/Memory 8 [O0sw] [mod 110 r/m] ### 1 1 
Immediate to Accumulator (short form) 3 [010w] ### 1 1 
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Instruction Notes for Instruction Set Summary 

Notes a through c apply to Real Address Mode only: 

a. This is a Protected Mode instruction. Attempted execution in 
Real Mode will result in exception 6 (invalid opcode). 


b. Exception 13 fault (general protection) will occur in Real 
Mode if an operand reference is made that partially or fully 
extends beyond the maximum CS, DS, ES, FS, or GS seg- 
ment limit (FFFFH). Exception 12 fault (stack segment limit 
violation or not present) will occur in Real Mode if an operand 
reference is made that partially or fully extends beyond the 
maximum SS limit. 


c. This instruction may be executed in Real Mode. In Real 
Mode, its purpose is primarily to initialize the CPU for Pro- 
tected Mode. 


d. - 


Notes e through g apply to Real Address Mode and Protect- 

ed Virtual Address Mode: 

e. An exception may occur, depending on the value of the 
operand. 


f. LOCK# is automatically asserted, regardless of the pres- 
ence or absence of the LOCK prefix. 


g. LOCK# is asserted during descriptor table accesses. 


Notes h through r apply to Protected Virtual Address Mode 

only: 

h. Exception 13 fault will occur if the memory operand in CS, 
DS, ES, FS, or GS cannot be used due to either a segment 
limit violation or an access rights violation. If a stack limit is 
violated, an exception 12 occurs. 


i. For segment load operations, the CPL, RPL, and DPL must 
agree with the privilege rules to avoid an exception 13 fault. 
The segment’s descriptor must indicate “present” or excep- 
tion 11 (CS, DS, ES, FS, GS not present). If the SS register 
is loaded and a stack segment not present is detected, an 
exception 12 occurs. 


j. All segment descriptor accesses in the GDT or LDT made by 
this instruction will automatically assert LOCK# to maintain 
descriptor integrity in multiprocessor systems. 


k. JMP, CALL, INT, RET, and IRET instructions referring to 
another code segment will cause an exception 13, if an 
applicable privilege rule is violated. 


I. An exception 13 fault occurs if CPL is greater than 0 (0 is the 
most privileged level). 


m. An exception 13 fault occurs if CPL is greater than IOPL. 


n. The IF bit of the flag register is not updated if CPL is greater 
than IOPL. The IOPL and VM fields of the flag register are 
updated only if CPL = 0. 


o. The PE bit of the MSW (CRO) cannot be reset by this 
instruction. Use MOV into CRO if desiring to reset the PE bit. 


p. Any violation of privilege rules as apply to the selector oper- 
and does not cause a Protection exception, rather, the zero 
flag is cleared. 


q. If the coprocessor’s memory operand violates a segment 
limit or segment access rights, an exception 13 fault will 
occur before the ESC instruction is executed. An exception 
12 fault will occur if the stack limit is violated by the oper- 
and’s starting address. 


r. The destination of a JMP, CALL, INT, RET, or IRET must be 
in the defined limit of a code segment or an exception 13 
fault will occur. 


Note s applies to Cyrix-specific SMM instructions: 

s. All memory accesses to SMM space are non-cacheable. An 
invalid opcode exception 6 occurs unless SMI is enabled 
and SMAR size > 0, and CPL = 0 and [SMAC is set or if in 
an SMI handler]. 


Note t applies to cache invalidation instruction with the 

cache operating in write-back mode: 

t. The total clock count is the clock count shown plus the num- 
ber of clocks required to write all “modified” cache lines to 
external memory. 
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9.4 FPU Instruction Set 


The processor core is functionally divided into the 
FPU, and the integer unit. The FPU processes 
floating point instructions only and does so in 
parallel with the integer unit. 


For example, when the integer unit detects a 
floating point instruction without memory operands, 
after two clock cycles the instruction passes to the 
FPU for execution. The integer unit continues to 
execute instructions while the FPU executes the 
floating point instruction. If another FPU instruction 
is encountered, the second FPU instruction is 
placed in the FPU queue. Up to four FPU instruc- 
tions can be queued. In the event of an FPU 
exception, while other FPU instructions are 
queued, the state of the CPU is saved to ensure 
recovery. 


The instruction set for the FPU is summarized in 
Table 9-29. The table uses abbreviations that are 
described Table 9-28. 


a. FPU Instruction Set 


Table 9-28 FPU Instruction Set Table Legend 


Abbr. Description 
n Stack register number 
TOS Top of stack register pointed to by SSS in 
the status register. 
ST(1) FPU register next to TOS 
ST(n) A specific FPU register, relative to TOS 
M.WI 16-bit integer operand from memory 
M.SI 32-bit integer operand from memory 
M.LI 64-bit integer operand from memory 
M.SR 32-bit real operand from memory 
M.DR 64-bit real operand from memory 
M.XR 80-bit real operand from memory 
M.BCD 18-digit BCD integer operand from memory 
CC FPU condition code 
Env Regs Status, Mode Control and Tag Registers, 


Instruction Pointer and Operand Pointer 
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Table 9-29 FPU Instruction Set Summary 


FPU Instruction Opcode Operation cain Notes 
F2XM1 Function Evaluation 2°-1 D9 FO TOS <--- 2708-4 92 - 108 2 
FABS Floating Absolute Value D9 E1 TOS <--- | TOS | 2 2 
FADD Floating Point Add 
Top of Stack DC [1100 0 nj ST(n) <--- ST(n) + TOS 4-9 
80-bit Register D8 [1100 0 nj] TOS <--- TOS + ST(n) 4-9 
64-bit Real DC [mod 000 r/m] TOS <--- TOS + M.DR 4-9 
32-bit Real D8 [mod 000 r/m] TOS <--- TOS + M.SR 4-9 
FADDP Floating Point Add, Pop DE [1100 0 n] ST(n) <--- ST(n) + TOS; then pop TOS 
FIADD Floating Point Integer Add 
32-bit integer DA [mod 000 r/m] TOS <--- TOS + M.SI 8-14 
16-bit integer DE [mod 000 r/m] TOS <--- TOS + M.WI 8-14 
FCHS Floating Change Sign D9 EO TOS <--- - TOS 2 
FCLEX Clear Exceptions (9B) DB E2 Wait then Clear Exceptions 
FNCLEX Clear Exceptions DB E2 Clear Exceptions 
FCMOVB Floating Point Conditional Move if DA [1100 0 nj] If (CF=1) ST(0) <--- ST(n) 4 
Below 
FCMOVE Floating Point Conditional Move if DA [1100 1 n] If (ZF=1) ST(0) <--- ST(n) 4 
Equal 
FCMOVBE Floating Point Conditional Move if DA [1101 0 n] If (CF=1 or ZF=1) ST(0) <--- ST(n) 4 
Below or Equal 
FCMOVU Floating Point Conditional Move if DA [1101 1 n] If (PF=1) ST(0) <--- ST(n) 4 
Unordered 
FCMOVNB Floating Point Conditional Move if DB [1100 0 nj] If (CF=0) ST(0) <--- ST(n) 4 
Not Below 
FCMOVNE Floating Point Conditional Move if DB [1100 1 n] If (ZF=0) ST(0) <--- ST(n) 4 
Not Equal 
FCMOVNBE Floating Point Conditional Move if | DB [1101 0 nj] If (CF=0 and ZF=0) ST(0) <--- ST(n) 4 
Not Below or Equal 
FCMOVNU Floating Point Conditional Move if DB [1101 1 n] If (DF=0) ST(0) <--- ST(n) 4 
Not Unordered 
FCOM Floating Point Compare 
80-bit Register D8 [1101 0 n] CC set by TOS - ST(n) 
64-bit Real DC [mod 010 r/m] CC set by TOS - M.DR 
32-bit Real D8 [mod 010 r/m] CC set by TOS - M.SR 4 
FCOMP Floating Point Compare, Pop 
80-bit Register D8 [1101 1 n] CC set by TOS - ST(n); then pop TOS 4 
64-bit Real DC [mod 011 r/m] CC set by TOS - M.DR; then pop TOS 4 
32-bit Real D8 [mod 011 r/m] CC set by TOS - M.SR; then pop TOS 4 
FCOMPP Floating Point Compare, Pop DE D9 CC set by TOS - ST(1); then pop TOS and 4 
Two Stack Elements ST(1) 
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Table 9-29 FPU Instruction Set Summary (cont.) 


FPU Instruction Opcode Operation con Motes 

FCOMI Floating Point Compare Real and Set EFLAGS 

80-bit Register DB [1111 0 n] EFLAG set by TOS - ST(n) 4 
FCOMIP Floating Point Compare Real and Set EFLAGS, Pop 

80-bit Register DF [1111 0 nj] EFLAG set by TOS - ST(n); then pop TOS 4 
FUCOMI Floating Point Unordered Compare Real and Set EFLAGS 

80-bit Integer DB [1110 1 n] EFLAG set by TOS - ST(n) 9-10 
FUCOMIP Floating Point Unordered Compare Real and Set EFLAGS, Pop 

80-bit Integer DF [1110 1 n] EFLAG set by TOS - ST(n); then pop TOS 9-10 
FICOM Floating Point Integer Compare 

32-bit integer DA [mod 010 r/m] CC set by TOS - M.WI 9-10 

16-bit integer DE [mod 010 r/m] CC set by TOS - M.SI 9-10 
FICOMP Floating Point Integer Compare, Pop 

32-bit integer DA [mod 011 r/m] CC set by TOS - M.WI; then pop TOS 9-10 

16-bit integer DE [mod 011 r/m CC set by TOS - M.SI; then pop TOS 9-10 
FCOS Function Evaluation: Cos(x) D9.FF TOS <--- COS(TOS) 92 - 141 1 
FDECSTP Decrement Stack pointer D9 F6 Decrement top of stack pointer 4 
FDIV Floating Point Divide 

Top of Stack DC [1111 1 n] ST(n) <--- ST(n) / TOS 24 - 34 

80-bit Register D8 [1111 On] TOS <--- TOS / ST(n) 24 - 34 

64-bit Real DC [mod 110 r/m] TOS <--- TOS / M.DR 24 - 34 

32-bit Real D8 [mod 110 r/m] TOS <--- TOS / M.SR 24 - 34 
FDIVP Floating Point Divide, Pop DE [1111 1 n] ST(n) <--- ST(n) / TOS; then pop TOS 24 - 34 
FDIVR Floating Point Divide Reversed 

Top of Stack DC [1111 0 n] TOS <--- ST(n) / TOS 24 - 34 

80-bit Register D8 [1111 1 n] ST(n) <--- TOS / ST(n) 24 - 34 

64-bit Real DC [mod 111 r/m] TOS <--- M.DR/ TOS 24 - 34 

32-bit Real D8 [mod 111 r/m] TOS <--- M.SR/ TOS 24 - 34 
FDIVRP Floating Point Divide Reversed, Pop DE [1111 0 n] ST(n) <--- TOS / ST(n); then pop TOS 24 - 34 
FIDIV Floating Point Integer Divide 

32-bit Integer DA [mod 110 r/m] TOS <--- TOS / M.SI 34 - 38 

16-bit Integer DE [mod 110 r/m] TOS <--- TOS / M.WI 34 - 38 
FIDIVR Floating Point Integer Divide Reversed 

32-bit Integer DA [mod 111 r/m] TOS <--- M.SI / TOS 34 - 38 

16-bit Integer DE [mod 111 r/m] TOS <--- M.WI/ TOS 34 - 38 
FFREE Free Floating Point Register DD [1100 0 nj] TAG(n) <--- Empty 4 
FINCSTP /ncrement Stack Pointer D9 F7 Increment top-of-stack pointer 2 
FINIT /nitialize FPU (9B)DB E3 Wait, then initialize 8 
FNINIT /nitialize FPU DB E3 Initialize 6 
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Table 9-29 FPU Instruction Set Summary (cont.) 


FPU Instruction Opcode Operation enunt Motes 

FLD Load Data to FPU Register 

Top of Stack D9 [1100 0 nj] Push ST(n) onto stack 

64-bit Real DD [mod 000 r/m] Push M.DR onto stack 

32-bit Real D9 [mod 000 r/m] Push M.SR onto stack 
FBLD Load Packed BCD Data to FPU Register | DF [mod 100 r/m] Push M.BCD onto stack 41-45 
FILD Load Integer Data to FPU Register 

64-bit Integer DF [mod 101 r/m] Push M.LI onto stack 4-8 

32-bit Integer DB [mod 000 r/m] Push M.SI onto stack 4-6 

16-bit Integer DF [mod 000 r/m] Push M.WI onto stack 3-6 
FLD1 Load Floating Const.= 1.0 D9 E8 Push 1.0 onto stack 
FLDCW Load FPU Mode Control Register D9 [mod 101 r/m] Ctl Word <--- Memory 
FLDENV Load FPU Environment D9 [mod 100 r/m] Env Regs <--- Memory 30 
FLDL2E Load Floating Const.= Logo(e) D9 EA Push Logs(e) onto stack 4 
FLDL2T Load Floating Const.= Logo(10) D9 E9 Push Logs(10) onto stack 4 
FLDLG2 Load Floating Const.= Log 10(2) D9 EC Push Logj9(2) onto stack 4 
FLDLN2 Load Floating Const.= Ln(2) D9 ED Push Log,(2) onto stack 4 
FLDPI Load Floating Const.=1 D9 EB Push z onto stack 4 
FLDZ Load Floating Const.= 0.0 D9 EE Push 0.0 onto stack 4 
FMUL Floating Point Multiply 

Top of Stack DC [1100 1 nj ST(n) <--- ST(n) x TOS -9 

80-bit Register D8 [1100 1 n] TOS <--- TOS x ST(n) -9 

64-bit Real DC [mod 001 r/m] TOS <--- TOS x M.DR - 

32-bit Real D8 [mod 001 r/m] TOS <--- TOS x M.SR -6 
FMULP Floating Point Multiply & Pop DE [1100 1 n] ST(n) <--- ST(n) x TOS; then pop TOS 4-9 
FIMUL Floating Point Integer Multiply 

32-bit Integer DA [mod 001 r/m] TOS <--- TOS x M.SI 9-11 

16-bit Integer DE [mod 001 r/m] TOS <--- TOS x M.WI 8-10 
FNOP No Operation D9 DO No Operation 2 
FPATAN Function Eval: Tan’! (y/x) D9 F3 ST(1) <--- ATAN[ST(1) / TOS]; then pop TOS 97 - 161 3 
FPREM Floating Point Remainder D9 F8 TOS <--- Rem[TOS / ST(1)] 82-91 
FPREM1 Floating Point Remainder IEEE D9 F5 TOS <--- Rem[TOS / ST(1)] 82-91 
FPTAN Function Eval: Tan(x) D9 F2 TOS <--- TAN(TOS); then push 1.0 onto stack | 117 - 129 1 
FRNDINT Round to Integer D9 FC TOS <--- Round(TOS) 10-20 
FRSTOR Load FPU Environment and Register DD [mod 100 r/m] Restore state 56 - 72 
FSAVE Save FPU Environment and Register (9B)DD [mod 110 r/m] Wait, then save state 57 - 67 
FNSAVE Save FPU Environment and Register DD [mod 110 r/m] Save state 55 - 65 
FSCALE Floating Multiply by 2” D9 FD TOS <--- TOS x 2811) 7-14 
FSIN Function Evaluation: Sin(x) D9.FE TOS <--- SIN(TOS) 76 - 140 1 
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Table 9-29 FPU Instruction Set Summary (cont.) 


FPU Instruction Set 


FPU Instruction Opcode Operation con Notre 
FSINCOS Function Eval.: Sin(x)& Cos(x) D9 FB temp <--- TOS; 145 - 161 1 
TOS <--- SIN(temp); then 
push COS(temp) onto stack 

FSQRT Floating Point Square Root D9 FA TOS <--- Square Root of TOS 59 - 60 
FST Store FPU Register 

Top of Stack DD [1101 0 nj ST(n) <--- TOS 

64-bit Real DD [mod 010 r/m] M.DR <--- TOS 

32-bit Real D9 [mod 010 r/m] M.SR <--- TOS 
FSTP Store FPU Register, Pop 

Top of Stack DB [1101 1 n] ST(n) <--- TOS; then pop TOS 2 

80-bit Real DB [mod 111 r/m] M.XR <--- TOS; then pop TOS 2 

64-bit Real DD [mod 011 r/m] M.DR <--- TOS; then pop TOS 2 

32-bit Real D9 [mod 011 r/m] M.SR <--- TOS; then pop TOS 2 
FBSTP Store BCD Data, Pop DF [mod 110 r/m] M.BCD <--- TOS; then pop TOS 57 - 63 
FIST Store Integer FPU Register 

32-bit Integer DB [mod 010 r/m] M.SI <--- TOS 8-13 

16-bit Integer DF [mod 010 r/m] M.WI <--- TOS - 10 
FISTP Store Integer FPU Register, Pop 

64-bit Integer DF [mod 111 r/m] M.LI <--- TOS; then pop TOS 10-13 

32-bit Integer DB [mod 011 r/m] M.SI <--- TOS; then pop TOS 8-13 

16-bit Integer DF [mod 011 r/m] M.WI <--- TOS; then pop TOS 7-10 
FSTCW Store FPU Mode Control Register 9B)D9 [mod 111 r/m] Wait Memory <--- Control Mode Register 5 
FNSTCW Store FPU Mode Control Register D9 [mod 111 r/m] Memory <--- Control Mode Register 3 
FSTENV Store FPU Environment 9B)D9 [mod 110 r/m] Wait Memory <--- Env. Registers 14-24 
FNSTENV Store FPU Environment D9 [mod 110 r/m] Memory <--- Env. Registers 12 +22 
FSTSW Store FPU Status Register 9B)DD [mod 111 r/m] Wait Memory <--- Status Register 6 
FNSTSW Store FPU Status Register DD [mod 111 r/m] Memory <--- Status Register 4 
FSTSW AX Store FPU Status Register to AX 9B)DF EO Wait AX <--- Status Register 4 
FNSTSW AX Store FPU Status Register to AX DF EO AX <--- Status Register 2 
FSUB Floating Point Subtract 

Top of Stack DC [1110 1 n] ST(n) <--- ST(n) - TOS -9 

80-bit Register D8 [1110 0 n] TOS <--- TOS - ST(n - 

64-bit Real DC [mod 100 r/m] TOS <--- TOS - M.DR 4-9 

32-bit Real D8 [mod 100 r/m] TOS <--- TOS - M.SR 4-9 
FSUBP Floating Point Subtract, Pop DE [1110 1 nj] ST(n) <--- ST(n) - TOS; then pop TOS 4-9 
FSUBR Floating Point Subtract Reverse 

Top of Stack DC [1110 0 n] TOS <--- ST(n) - TOS -9 

80-bit Register D8 [11101 n] ST(n) <--- TOS - ST(n) -9 

64-bit Real DC [mod 101 r/m] TOS <--- M.DR - TOS “9 

32-bit Real D8 [mod 101 r/m] TOS <--- M.SR - TOS - 
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Table 9-29 FPU Instruction Set Summary (cont.) 


FPU Instruction Opcode Operation Count Hotes 

FSUBRP Floating Point Subtract Reverse, Pop | DE [11100 n] ST(n) <--- TOS - ST(n); then pop TOS 4-9 
FISUB Floating Point Integer Subtract 

32-bit Integer DA [mod 100 r/m] TOS <--- TOS - M.SI 14-29 

16-bit Integer DE [mod 100 r/m] TOS <--- TOS - M.WI 14-27 
FISUBR Floating Point Integer Subtract Reverse 

32-bit Integer Reversed DA [mod 101 r/m] TOS <--- M.SI - TOS 14-29 

16-bit Integer Reversed DE [mod 101 r/m] TOS <--- M.WI - TOS 14-27 
FTST Test Top of Stack D9 E4 CC set by TOS - 0.0 4 
FUCOM Unordered Compare DD [1110 0 nj] CC set by TOS - ST(n) 4 
FUCOMP Unordered Compare, Pop DD [1110 1 n] CC set by TOS - ST(n); then pop TOS 4 
FUCOMPP Unordered Compare, Pop two DA EQ CC set by TOS - ST(I); then pop TOS and 4 

elements ST(1) 
FWAIT Wait 9B Wait for FPU not busy 2 
FXAM Report Class of Operand D9 E5 CC <--- Class of TOS 4 
FXCH Exchange Register with TOS D9 [1100 1 n] TOS <--> ST(n) Exchange 3 
FXTRACT Extract Exponent D9 F4 temp <--- TOS; 11-16 

TOS <--- exponent (temp); then 
push significant (temp) onto stack 
FLY2X Function Eval. y x Log2(x) D9 F1 ST(1) <--- ST(1) x Logo(TOS); then pop TOS 145 - 154 
FLY2XP1 Function Eval. y x Log2(x+1) D9 F9 ay <--- ST(1) x Logo(1+TOS); then pop 131 - 133 4 
FPU Instruction Summary Notes 2. For F2XM1, clock count is 92 if absolute value of TOS < 0.5. 
All references to TOS and ST(n) refer to stack layout prior to 3. For FPATAN, clock count is 97 if ST(1)/TOS < 7/32. 


execution. 

Values popped off the stack are discarded. 

A pop from the stack increments the top of stack pointer. 
A push to the stack decrements the top of stack pointer. 


4. For FYL2XP1, clock count is 170 if TOS is out of range and 
regular FYL2X is called. 


5. The following opcodes are reserved by Cyrix: 
D9D7, D9E2, D9E7, DDFC, DED8, DEDA, DEDC, DEDD, 


Notes: DEDE, DFFC. 

1. For FCOS, FSIN, FSINCOS and FPTAN, time shown is for if . F | | 
absolute value of TOS < 3p/4. Add 90 clock counts for argu- a esoned Spode ie ekeciied sand Unbbediciable tesulls 
ment reduction if outside this range. may occur (exceptions are not generated). 


For FCOS, clock count is 141 if TOS < 2/4 and clock count is 
92 if 1/4 < TOS > 2/2. 


For FSIN, clock count is 81 to 82 if absolute value of TOS < 
m/4. 
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9.5 MMX™ Instruction Set 


The CPU is functionally divided into the FPU unit, 

and the integer unit. The FPU has been extended 

to processes both MMX™ instructions and floating 
point instructions in parallel with the integer unit. 


For example, when the integer unit detects a MMX 
instruction, the instruction passes to the FPU unit 
for execution. The integer unit continues to execute 
instructions while the FPU unit executes the MMX 
instruction. If another MMX instruction is encoun- 
tered, the second MMxX instruction is placed in the 
MMX queue. Up to four MMX instructions can be 
queued. 


MMxX instruction set is summarized in Table 9-31. 
The abbreviations used in the table are listed Table 
9-30. 


Table 9-30 MMX Instruction Set Table Legend 


Abbreviation Description 

<---- Result written 

[11 mm reg] Binary or binary groups of digits 

mm One of eight 64-bit MMX registers 

reg A general purpose register 

<--sat-- If required, the resultant data is saturated 
to remain in the associated data range 

<--move-- Source data is moved to result location 

[byte] Eight 8-bit bytes are processed in parallel 

[word] Four 16-bit word are processed in parallel 

[dword] Two 32-bit double words are processed in 
parallel 

[qword] One 64-bit quad word is processed 

[Sign xxx] The byte, word, double word or quad 
word most significant bit is a sign bit 

mm1, mm2 MMxX Register 1, MMX Register 2 

mod r/m Mod and r/m byte encoding (page 6-6 of 
this manual) 

pack Source data is truncated or saturated to 
next smaller data size, then concate- 
nated. 

packdw Pack two double words from source and 
two double words from destination into 
four words in destination register. 

packwb Pack four words from source and four 
words from destination into eight bytes in 
destination register. 
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Table 9-31 MMX Instruction Set Summary 


MMX Instructions 


Opcode 


Operation and Clock Count (Latency/Throughput) 


EMMS Empty MMX State 


OF77 


Tag Word <--- FFFFh (empties the floating point tag word) 


1 


MOVD Move Doubleword 


Register to MMX Register 


OF6E [11 mm reg] 


MMX reg [qword] <--move, zero extend-- reg [dword] 


va) 


MMxX Register to Register OF7E [11 mm reg] reg [qword] <--move-- MMX reg [low dword] 5/1 

Memory to MMX Register OF6E [mod mm r/m]_|MMX regr[qword] <--move, zero extend-- memory[dword] 1/1 

MMxX Register to Memory OF7E [mod mm r/m] |Memory [dword] <--move-- MMX reg [low dword] val 
MOVQ Move Quardword 

MMxX Register 2 to MMX Register 1 | OF6F [11 mm1 mm2] | MMX reg 1 [qword] <--move-- MMX reg 2 [qword] v1 

MMxX Register 1 to MMX Register 2. | OF7F [11 mm1 mm2] | MMX reg 2 [qword] <--move-- MMX reg 1 [qword] val 

Memory to MMX Register OF6F [mod mm r/m]_ |MMX reg [qword] <--move-- memory[qword] val 

MMxX Register to Memory OF7F [mod mm r/m] |Memory [qword] <--move-- MMX reg [qword] 1 
PACKSSDW Pack Dword with Signed Saturation 

MMxX Register 2 to MMX Register 1 |OF6B [11 mm1 mm2] | MMX reg 1 [qword] <--packdw, signed sat-- MMX reg 2, MMX reg 1 1 

Memory to MMX Register OF6B [mod mm r/m]_ |MMX reg [qword] <--packdw, signed sat-- memory, MMX reg 1 
PACKSSWB Pack Word with Signed Saturation 

MMX Register 2 to MMX Register 1 | OF63 [11 mm1 mm2] | MMX reg 1 [qword] <--packwb, signed sat-- MMX reg 2, MMX reg 1 1 


Memory to MMX Register 


OF63 [mod mm r/m] 


MMxX reg [qword] <--packwb, signed sat-- memory, MMX reg 


1 


PACKUSWB Pack Word with Unsigned Saturation 


MMxX Register 2 to MMX Register 1 


OF67 [11 mm1 mm2] 


MMxX reg 1 [qword] <--packwb, unsigned sat-- MMX reg 2, MMX reg 1 


va) 


Memory to MMX Register 


OF67 [mod mm r/m] 


MMxX reg [qword] <--packwb, unsigned sat-- memory, MMX reg 


iva) 


PADDB Packed Add Byte with Wrap-Aro 


und 


MMxX Register 2 to MMX Register 1 


OFFC [11 mm1 mm2] 


MMX reg 1 [byte] <---- MMX reg 1 [byte] + MMX reg 2 [byte] 


11 


Memory to MMX Register 


OFFC [mod mm r/m] 


MMxX reg|[byte] <---- memory [byte] + MMX reg [byte] 


1 


PADDD Packed Add Dword with Wrap-Around 


MMX Register 2 to MMX Register 1 


OFFE [11 mm1 mm2] 


MMX reg 1 [sign dword] <---- MMX reg 1 [sign dword] + MMX reg 2 [sign 
dword] 


vA) 


Memory to MMX Register 


OFFE [mod mm r/m] 


MMxX reg [sign dword] <---- memory [sign dword] + MMX reg [sign dword] 


11 


PADDSB Packed Add Signed Byte with Saturation 


MMX Register 2 to MMX Register 1 


OFEC [11 mm1 mm2] 


MMxX reg 1 [sign byte] <--sat-- MMX reg 1 [sign byte] + MMX reg 2 [sign byte] 


1/1 


Memory to Register 


OFEC [mod mm r/m] 


MMxX reg [sign byte] <--sat-- memory [sign byte] + MMX reg [sign byte] 


1 


PADDSW Packed Add Signed Word with 


Saturation 


MMX Register 2 to MMX Register 1 


OFED [11 mm1 mm2] 


MMX reg 1 [sign word] <--sat-- MMX reg 1 [sign word] + MMX reg 2 [sign 
word] 


11 


Memory to Register 


OFED [mod mm r/m] 


MMX reg [sign word] <--sat-- memory [sign word] + MMX reg [sign word] 


va) 


PADDUSB Add Unsigned Byte with Satu 


ration 


MMxX Register 2 to MMX Register 1 


OFDC [11 mm1 mm2] 


MMxX reg 1 [byte] <--sat-- MMX reg 1 [byte] + MMX reg 2 [byte] 


1 


Memory to Register 


OFDC [mod mm r/m] 


MMxX reg [byte] <--sat-- memory [byte] + MMX reg [byte] 


va) 
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Table 9-31 MMX Instruction Set Summary (cont.) 


MMX Instructions Opcode Operation and Clock Count (Latency/Throughput) 
PADDUSW Add Unsigned Word with Saturation 
MMX Register 2 to MMX Register 1 |OFDD [11 mm1 mm2]}MMX reg 1 [word] <--sat-- MMX reg 1 [word] + MMX reg 2 [word] 1 
Memory to Register OFDD [mod mm r/m] |MMX reg [word] <--sat-- memory [word] + MMX reg [word] aval 


PADDW Packed Add Word with Wrap-Around 
MMX Register 2 to MMX Register 1. |OFFD [11 mm1 mm2] | MMX reg 1 [word] <---- MMX reg 1 [word] + MMX reg 2 [word] 1 
Memory to MMX Register OFFD [mod mm r/m]_|MMX reg [word] <---- memory [word] + MMX reg [word] 1 


PAND Bitwise Logical AND 
MMX Register 2 to MMX Register 1. |OFDB [11 mm1 mm2]|}MMX reg 1 [qword] <--logic AND-- MMX reg 1 [qword], MMX reg 2 [qword] 1 
Memory to MMX Register OFDB [mod mm r/m] |MMX reg [qword] <--logic AND-- memory [qword], MMX reg [qword] 


PANDN Bitwise Logical AND NOT 


MMX Register 2 to MMX Register 1. | OFDF [11 mm1 mm2] | MMX reg 1 [qword] <--logic AND -- NOT MMX reg 1 [qword], MMX reg 2 1 
[qword] 
Memory to MMX Register OFDF [mod mm r/m]_ |MMX reg [qword] <--logic AND-- NOT MMX reg [qword], Memory [qword] 1/1 


PCMPEGB Packed Byte Compare for Equality 


MMxX Register 2 with MMX Register |OF74 [11 mm1 mm2] | MMX reg 1 [byte] <--FFh-- if MMX reg 1 [byte] = MMX reg 2 [byte] 1 
1 MMX reg 1 [byte]<--00h-- if MMX reg 1 [byte] NOT = MMX reg 2 [byte] 
Memory with MMX Register 0OF74 [mod mm r/m]_ | MMX reg [byte] <--FFh-- if memory[byte] = MMX reg [byte] 1 


MMxX reg [byte] <--00h-- if memory[byte] NOT = MMX reg [byte] 


PCMPEQD Packed Dword Compare for Equality 


MMX Register 2 with MMX Register |OF76 [11 mm1 mm2] |MMX reg 1 [dword] <--FFFF FFFFh-- if MMX reg 1 [dword] = MMX reg 2 1 
1 [dword] 
MMX reg 1 [dword]<--0000 0000h--if MMX reg 1[dword] NOT = MMX reg 2 
[dword] 
Memory with MMX Register OF76 [mod mm r/m]_|MMX reg [dword] <--FFFF FFFFh-- if memory[dword] = MMX reg [dword] 1 


MMxX reg [dword] <--0000 0000h-- if memory[dword] NOT = MMxX reg [dword] 


PCMPEQW Packed Word Compare for Equality 


MMX Register 2 with MMX Register |0F75 [11 mm1 mm2] |MMX reg 1 [word] <--FFFFh-- if MMX reg 1 [word] = MMX reg 2 [word] 1 
1 MMX reg 1 [word]<--0000h-- if MMX reg 1 [word] NOT = MMX reg 2 [word] 
Memory with MMX Register OF75 [mod mm r/m]__| MMX reg [word] <--FFFFh-- if memory[word] = MMX reg [word] 1 


MMX reg [word] <--0000h-- if memory[word] NOT = MMX reg [word] 


PCMPGTB Pack Compare Greater Than Byte 


MMX Register 2 to MMX Register 1 | OF64 [11 mm1 mm2] | MMX reg 1 [byte] <--FFh-- if MMX reg 1 [byte] > MMX reg 2 [byte] 1 
MMX reg 1 [byte]<--00h-- if MMX reg 1 [byte] NOT > MMX reg 2 [byte] 
Memory with MMX Register OF64 [mod mm r/m] | MMX reg [byte] <--FFh-- if memory[byte] > MMX reg [byte] 1 


MMxX reg [byte] <--00h-- if memory[byte] NOT > MMX reg [byte] 


PCMPGTD Pack Compare Greater Than Dword 


MMX Register 2 to MMX Register 1 | OF66 [11 mm1 mm2] |MMX reg 1 [dword] <--FFFF FFFFh-- if MMX reg 1 [dword] > MMX reg 2 1 
[dword] 
MMX reg 1 [dword]<--0000 0000h--if MMX reg 1 [dwordJNOT > MMX reg 2 
[dword] 

Memory with MMX Register OF66 [mod mm r/m]_ |MMX reg [dword] <--FFFF FFFFh-- if memory[dword] > MMX reg [dword] 1 


MMxX reg [dword] <--0000 0000h-- if memory[dword] NOT > MMX reg [dword] 
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Table 9-31 MMX Instruction Set Summary (cont.) 


MMX Instructions 


Opcode 


Operation and Clock Count (Latency/Throughput) 


PCMPGTW Pack Compare Greater Than Word 


MMX Register 2 to MMX Register 1 |OF65 [11 mm1 mm2] |MMX reg 1 [word] <--FFFFh-- if MMX reg 1 [word] > MMX reg 2 [word] 1 
MMX reg 1 [word]<--0000h-- if MMX reg 1 [word] NOT > MMX reg 2 [word] 
Memory with MMX Register OF65 [mod mm r/m] | MMX reg [word] <--FFFFh-- if memory[word] > MMX reg [word] 1 
MMX reg [word] <--0000h-- if memory[word] NOT > MMX reg [word] 
PMADDWD Packed Multiply and Add 
MMX Register 2 to MMX Register 1 | OFF5 [11 mm1 mm2] | MMX reg 1 [dword] <--add-- [dword]<---- MMX reg 1 [sign word]*MMX reg 2/1 
2[sign word] 
Memory to MMX Register OFF5 [mod mm r/m]_ |MMX reg 1 [dword] <--add-- [dword] <---- memory [sign word] * Memory [sign | 2/1 
word] 
PMULHW Packed Multiply High 
MMX Register 2 to MMX Register 1 |OFE5 [11 mm1 mm2] | MMX reg 1 [word] <--upper bits-- MMX reg 1 [sign word] * MMX reg 2 [sign 2/1 
word] 
Memory to MMX Register OFE5 [mod mm r/m]_ |MMX reg 1 [word] <--upper bits-- memory [sign word] * Memory [sign word] 2/1 
PMULLW Packed Multiply Low 
MMX Register 2 to MMX Register 1. |OFD5 [11 mm1 mm2] | MMX reg 1 [word] <--lower bits-- MMX reg 1 [sign word] * MMX reg 2 [sign 2/1 
word] 
Memory to MMX Register OFD5 [mod mm r/m]_ |MMX reg 1 [word] <--lower bits-- memory [sign word] * Memory [sign word] 2/1 
POR Bitwise OR 
MMX Register 2 to MMX Register 1 |OFEB [11 mm1 mm2] | MMX reg 1 [qword] <--logic OR-- MMX reg 1 [qword], MMX reg 2 [qword] V1 
Memory to MMX Register OFEB [mod mm r/m]_ |MMX reg [qword] <--logic OR-- MMX reg [qword], memory[qword] 1 
PSLLD Packed Shift Left Logical Dword 
MMxX Register 1 by MMX Register 2 |OFF2 [11 mm1 mm2] |MMX reg 1 [dword] <--shift left, shifting in zeroes by MMX reg 2 [dword]-- 1 
MMX Register by Memory OFF2 [mod mm r/m]_ |MMX reg [dword] <--shift left, shifting in zeroes by memory[dword]-- 1 
MMX Register by Immediate OF72 [11 110 mm] # |MMX reg [dword] <--shift left, shifting in zeroes by [im byte]-- val 
PSLLQ Packed Shift Left Logical Qword 
MMX Register 1 by MMX Register 2 | OFF3 [11 mm1 mm2] | MMX reg 1 [qword] <--shift left, shifting in zeroes by MMX reg 2 [qword]-- 1/1 
MMX Register by Memory OFF3 [mod mm r/m]_ |MMX reg [qword] <--shift left, shifting in zeroes by [qword]-- 1/1 
MMX Register by Immediate OF73 [11 110 mm] # | MMX reg [qword] <--shift left, shifting in zeroes by [im byte]-- 1/1 
PSLLW Packed Shift Left Logical Word 
MMX Register 1 by MMX Register 2 | OFF1 [11 mm1 mm2] | MMX reg 1 [word] <--shift left, shifting in zeroes by MMX reg 2 [word]-- val 
MMX Register by Memory OFF1 [mod mm r/m]_ |MMX reg [word] <--shift left, shifting in zeroes by memory[word]-- val 
MMX Register by Immediate OF71 [11 110mm] # | MMX reg [word] <--shift left, shifting in zeroes by [im byte]-- val 
PSRAD Packed Shift Right Arithmetic Dword 
MMX Register 1 by MMX Register 2 | OFE2 [11 mm1 mm2] | MMX reg 1 [dword] <--arith shift right, shifting in zeroes by MMX reg 2 [dword-} 1/1 
| 
MMxX Register by Memory OFE2 [mod mm r/m]_|MMX reg [dword] <--arith shift right, shifting in zeroes by memory[dword]-- 1/1 
MMX Register by Immediate OF72 [11 100 mm] # |MMX reg [dword] <--arith shift right, shifting in zeroes by [im byte]-- aval 
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Table 9-31 MMX Instruction Set Summary (cont.) 


MMX™ Instruction Set 


MMX Instructions 


Opcode 


Operation and Clock Count (Latency/Throughput) 


PSRAW Packed Shift Right Arithmetic Word 


MMxX Register 1 by MMX Register 2 |OFE1 [11 mm1 mm2] | MMX reg 1 [word] <--arith shift right, shifting in zeroes by MMX reg 2 [word]-- 1 
MMX Register by Memory OFE1 [mod mm r/m]_ | MMX reg [word] <--arith shift right, shifting in zeroes by memory[word--] 1 
MMX Register by Immediate OF71 [11 100 mm] # |MMX reg [word] <--arith shift right, shifting in zeroes by [im byte]-- val 
PSRLD Packed Shift Right Logical Dword 
MMX Register 1 by MMX Register 2 | OFD2 [11 mm1 mm2] | MMX reg 1 [dword] <--shift right, shifting in zeroes by MMX reg 2 [dword]-- 1 
MMxX Register by Memory OFD2 [mod mm r/m]_ |MMX reg [dword] <--shift right, shifting in zeroes by memory[dword]-- 1/1 
MMX Register by Immediate OF72 [11 010 mm] # | MMX reg [dword] <--shift right, shifting in zeroes by [im byte]-- 1/1 
PSRLQ Packed Shift Right Logical Qword 
MMX Register 1 by MMX Register 2 |OFD3 [11 mm1 mm2] | MMX reg 1 [qword] <--shift right, shifting in zeroes by MMX reg 2 [qword] 1 
MMX Register by Memory OFD3 [mod mm r/m]_ |MMX reg [qword] <--shift right, shifting in zeroes by memory[qword] 1 
MMX Register by Immediate OF73 [11 010 mm] # | MMX reg [qword] <--shift right, shifting in zeroes by [im byte] 1 
PSRLW Packed Shift Right Logical Word 
MMX Register 1 by MMX Register 2 |OFD1 [11 mm1 mm2] | MMX reg 1 [word] <--shift right, shifting in zeroes by MMX reg 2 [word] 1 
MMX Register by Memory OFD1 [mod mm r/m]_|MMX reg [word] <--shift right, shifting in zeroes by memory[word] 1/1 
MMX Register by Immediate OF71 [11 010 mm] # |MMX reg [word] <--shift right, shifting in zeroes by imm[word] 1/1 
PSUBB Subtract Byte With Wrap-Around 
MMX Register 2 to MMX Register 1 | OFF8 [11 mm1 mm2] | MMX reg 1 [byte] <---- MMX reg 1 [byte] subtract MMX reg 2 [byte] 1 
Memory to MMX Register OFF8 [mod mm r/m]_ |MMX reg [byte] <---- MMX reg [byte] subtract memory [byte] V1 
PSUBD Subtract Dword With Wrap-Around 
MMxX Register 2 to MMX Register 1 | OFFA [11 mm1 mm2] | MMX reg 1 [dword] <---- MMX reg 1 [dword] subtract MMX reg 2 [dword] iva 
Memory to MMX Register OFFA [mod mm r/m]_ |MMX reg [dword] <---- MMX reg [dword] subtract memory [dword] /1 
PSUBSB Subtract Byte Signed With Saturation 
MMX Register 2 to MMX Register 1 | OFE8 [11 mm1 mm2] | MMX reg 1 [sign byte] <--sat-- MMX reg 1 [sign byte] subtract MMX reg 2 [sign | 1/1 
byte] 
Memory to MMX Register OFE8 [mod mm r/m] |MMX reg [sign byte] <--sat-- MMX reg [sign byte] subtract memory [sign byte] | 1/1 
PSUBSW Subtract Word Signed With Saturation 
MMX Register 2 to MMX Register 1 | OFE9 [11 mm1 mm2] | MMX reg 1 [sign word] <--sat-- MMX reg 1 [sign word] subtract MMX reg 2 1 
[sign word] 
Memory to MMX Register OFE9 [mod mm r/m]_|MMX reg [sign word] <--sat-- MMX reg [sign word] subtract memory [sign 1 
word] 
PSUBUSB Subtract Byte Unsigned With Saturation 
MMX Register 2 to MMX Register 1 | OFD8 [11 mm1 mm2] | MMX reg 1 [byte] <--sat-- MMX reg 1 [byte] subtract MMX reg 2 [byte] WV 
Memory to MMX Register OFD8 [11 mm reg] MMxX reg [byte] <--sat-- MMX reg [byte] subtract memory [byte] WV 
PSUBUSW Subtract Word Unsigned With Saturation 
MMX Register 2 to MMX Register 1 | OFD9 [11 mm1 mm2] | MMX reg 1 [word] <--sat-- MMX reg 1 [word] subtract MMX reg 2 [word] 1 
Memory to MMX Register OFD9 [11 mm reg] MMxX reg [word] <--sat-- MMX reg [word] subtract memory [word] V1 
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Table 9-31 MMX Instruction Set Summary (cont.) 


MMX Instructions 


Opcode 


Operation and Clock Count (Latency/Throughput) 


PSUBW Subtract Word With Wrap-Around 


MMxX Register 2 to MMX Register 1 


OFF9 [11 mm1 mm2] 


MMxX reg 1 [word] <---- MMX reg 1 [word] subtract MMX reg 2 [word] 


va) 


Memory to MMX Register 


OFF9 [mod mm r/m] 


MMxX reg [word] <---- MMX reg [word] subtract memory [word] 


1 


PUNPCKHBW Unpack High Packed Byte, Data to Packed Words 


MMX Register 2 to MMX Register 1 


OF68 [11 mm1 mm2] 


MMxX reg 1 [byte] <--interleave-- MMX reg 1 [up byte], MMX reg 2 [up byte] 


1 


Memory to MMX Register 


OF68 [11 mm reg] 


MMxX reg [byte] <--interleave-- memory [up byte], MMX reg [up byte] 


1 


PUNPCKHDQ Unpack High Packed Dword, Data to Qword 


MMxX Register 2 to MMX Register 1 


OF6A [11 mm1 mm2] 


MMX reg 1 [dword] <--interleave-- MMX reg 1 [up dword], MMX reg 2 [up 
dword] 


vA) 


Memory to MMX Register 


OF6A [11 mm reg] 


MMxX reg [dword] <--interleave-- memory [up dword], MMX reg [up dword] 


va) 


PUNPCKHWD Unpack High Packed Word, Data to Packed Dwords 


MMX Register 2 to MMX Register 1 


OF69 [11 mm1 mm2] 


MMX reg 1 [word] <--interleave-- MMX reg 1 [up word], MMX reg 2 [up word] 


va) 


Memory to MMX Register 


OF69 [11 mm reg] 


MMxX reg [word] <--interleave-- memory [up word], MMX reg [up word] 


1 


PUNPCKLBW Unpack Low Packed Byte. 


, Data to Packed Words 


MMX Register 2 to MMX Register 1 


OF60 [11 mm1 mm2] 


MMX reg 1 [word] <--interleave-- MMX reg 1 [low byte], MMX reg 2 [low byte] 


vA) 


Memory to MMX Register 


OF60 [11 mm reg] 


MMxX reg [word] <--interleave-- memory [low byte], MMX reg [low byte] 


vA) 


PUNPCKLDQ Unpack Low Packed Dword, Data to Qword 


MMxX Register 2 to MMX Register 1 


OF62 [11 mm1 mm2] 


MMX reg 1 [word] <--interleave-- MMX reg 1 [low dword], MMX reg 2 [low 
dword] 


va) 


Memory to MMX Register 


OF62 [11 mm reg] 


MMxX reg [word] <--interleave-- memory [low dword], MMX reg [low dword] 


1 


PUNPCKLWD Unpack Low Packed Word, Data 


to Packed Dwords 


MMX Register 2 to MMX Register 1 


OF61 [11 mm1 mm2] 


MMX reg 1 [word] <--interleave-- MMX reg 1 [low word], MMX reg 2 [low word] 


va) 


Memory to MMX Register 


OF61 [11 mm reg] 


MMX reg [word] <--interleave-- memory [low word], MMX reg [low word] 


va) 


PXOR Bitwise XOR 


MMX Register 2 to MMX Register 1 


OFEF [11 mm1 mm2] 


MMxX reg 1 [qword] <--logic exclusive OR-- MMX reg 1 [qword], MMX reg 2 
[qword] 


1 


Memory to MMX Register 


OFEF [11 mm reg] 


MMxX reg [qword] <--logic exclusive OR-- memory[qword], MMX reg [qword] 


1/1 
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9.6 Cyrix Extended MMX™ Table 9-32 Cyrix Extend MMX Instruction Set 
Instruction Set Table Legend 
Cyrix has added instructions to its implementation Abbreviation Description 
of the Intel© MMX™ Architecture in order to facili- a Seanwiion 
tate writing of multimedia applications. In general, — 
these instructions allow more efficient implementa- Et fone reg Binab er binary groups! digits 
tion of multimedia algorithms, or more precision in mm One of eight 64-bit MMX registers 
computation than can be achieved using the basic reg A general purpose register 
ee OL MIN insthuchons: All a the added Inst: <--sat- If required, the resultant data is saturated 
tions follow the SIMD (single instruction, multiple to remain in the associated data range 
data) format. Many of the instructions add flexibility ee Siesta i SOUBALTG IERHILISGATIOR 
to the MMX architecture by allowing both source - 
operands of an instruction to be preserved, while [byte] Eight 8-bit bytes are processed in parallel 
the result goes to a separate register that is [word] Four 16-bit word are processed in parallel 
derived from the input. [dword] Two 32-bit double words are processed in 
: ; parallel 
Table 9-33 summarizes the Cyrix Extended MMX : 
Instructions. The abbreviations used in the table lgword) Ng Oe PIG ad WOIa IS pibceaeed 
are listed in Table 9-32. [sign xxx] The byte, word, double word or quad 
word most significant bit is a sign bit 
Configuration control register CCR7(0) at location mnomen MMX Register 1, MMX Register 2 
EBh must be set to allow the execution of the Cyrix an Meccan anon ding ( aaa 
; : mod r/m od and r/m byte encoding (page 6-6 o 
Extended MMxX instructions. this mania 
pack Source data is truncated or saturated to 
next smaller data size, then concate- 
nated. 
packdw Pack two double words from source and 
two double words from destination into 
four words in destination register. 
packwb Pack four words from source and four 
words from destination into eight bytes in 
destination 
register. 
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Table 9-33 Cyrix Extended MMX Instruction Set Summary 


MMX Instructions 


Opcode 


Operation and Clock Count 


PADDSIW Packed Add Signed Word with Saturation Using Implied Destination 


MMxX Register plus MMX Register to Implied Register 


OF51 [11 mm1 mm2] 


Memory plus MMX Register to Implied Register 


OF51 [mod mm r/m] 


Sum signed packed word from MMX register/memory ---> 
signed packed word in MMX register, saturate, and write 
result ---> implied register 


PAVEB Packed Average Byte 


MMxX Register 2 with MMX Register 1 


OF50 [11 mm1 mm2] 


Memory with MMX Register 


OF50 [mod mm r/m] 


Average packed byte from the MMX register/memory with 
packed byte in the MMX register. Result is placed in the MMX 
register. 


PDISTIB Packed Distance and Accumulate with Implied Register 


Memory, MMX Register to Implied Register 


OF54 [mod mm r/m] 


Find absolute value of difference between packed byte in 
memory and packed byte in the MMX register. Using 
unsigned saturation, accumulate with value in implied desti- 
nation register. 


PMACHRIW Packed Multiply and Accumulate with Rounding 


Memory to MMX Register 


OF5E[mod mm r/m] 


Multiply the packed word in the MMX register by the packed 
word in memory. Sum the 32-bit results pairwise. Accumulate 
the result with the packed signed word in the implied destina- 
tion register. 


PMAGW Packed Magnitude 


MMxX Register 2 to MMX Register 1 


OF52 [11 mm1 mm2] 


Memory to MMX Register 


OF52 [mod mm r/m] 


Set the destination equal ---> the packed word with the largest 
magnitude, between the packed word in the MMX regis- 
ter/memory and the MMX register. 


PMULHRIW Packed Multiply High with Rounding, Implied Destination 


MMX Register 2 to MMX Register1 


OF5D [11 mm1 mm2] 


Memory to MMX Register 


OF5D [mod mm r/m] 


Packed multiply high with rounding and store bits 30 - 15 in 
implied register. 


PMULHRW Packed Multiply High with Rounding 


MMX Register 2 to MMX Register 1 


OF59 [11 mm1 mm2] 


Memory to MMX Register 


OF59 [mod mm r/m] 


Multiply the signed packed word in the MMX register/memory 
with the signed packed word in the MMX register. Round with 
1/2 bit 15, and store bits 30 - 15 of result in the MMX register. 


PMVGEZB Packed Conditional Move If Greater Than or Equal to Zero 


Memory to MMX Register 


OF5C [mod mm r/m] 


Conditionally move packed byte from memory ---> packed 
byte in the MMX register if packed byte in implied MMX regis- 
ter is greater than or equal ---> zero. 


PMVLZB Packed Conditional Move If Less Than Zero 


Memory to MMX Register 


OF5B [mod mm r/m] 


Conditionally move packed byte from memory ---> packed 
byte in the MMX register if packed byte in implied MMX regis- 
ter is less than zero. 


PMVNZB Packed Conditional Move If Not Zero 


Memory to MMX Register 


OF5A [mod mm r/m] 


Conditionally move packed byte from memory ---> packed 
byte in the MMX register if packed byte in implied MMX regis- 
ter is not zero. 


PMVZB Packed Conditional Move If Zero 


Memory to MMX Register 


OF58 [mod mm r/m] 


Conditionally move packed byte from memory ---> packed 
byte in the MMX register if packed byte in implied the MMX 
register is zero. 


PSUBSIW Packed Subtracted with Saturation Using Implied Destination 


MMX Register 2 to MMX Register 1 


OF55 [11 mm1 mm2] 


Memory to MMX Register 


OF55 [mod mm r/m] 


Subtract signed packed word in the MMX register/memory 
from signed packed word in the MMX register, saturate, and 
write result ---> implied register. 


GXm_db_v2.0 


Cyrix Corporation Confidential 


Page 273 


Cyrix 
Media(QX rex 1 
e te (Am Cyrix Extended MMX™ Instruction Set 


M M X"*- E N H AN € E D 


Page 274 Cyrix Corporation Confidential GXm_db_v2.0 


MediaGX™ MMX™-Enhanced Processor 
Integrated x86 Solution with MMX™ Support 


Cyrix 
Media(QX 


MM X" = N H A N € E D | 
Ss 


Appendix A Support Documentation 


A.1 Order Information 


Cyrix National Part Core Temperature 
Part Number Number (NSID) Frequency (MHz) (Degree C) Package 
GM200P 30040-23 200 70 PGA 
GM200P-85 30041-23 200 85 PGA 
GM200B-85 30141-23 200 85 BGA 
GM233P 30050-33 233 70 PGA 
GM233P-85 30054-33 233 85 PGA 
GM233B-85 30151-33 233 85 BGA 
GM266P 30070-53 266 70 PGA 
GM266P-85 30071-53 266 85 PGA 
GM266B-85 30171-53 266 85 BGA 
GM300P 30080-63 300 70 PGA 
GM300P-85 30081-63 300 85 PGA 
GM300B-85 TBA 300 85 BGA 
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A.2._ Data Book Revision History 


This document is a report of the revision/creation deletions, parameter corrections, etc.) are 
process of the data book for the MediaGX MMX™- recorded in the tables below. 
Enhanced Processor. Any revisions (i.e., additions, 


Table A-1 Revision History 


Revision # 
(PDF Date) Revisions / Comments 
0.0 (2/5/98) Creation phase 
0.1 (2/25/98) Creation phase continues - added functional description. 
0.2 (3/24/98) Creation phase continues - added 233MHz parameters. 
0.3 (4/22/98) Creation phase continues - added 266MHz numbers. 
1.0 (8/13/98) All sections complete - added 300MHz numbers, added Index. 
2.0 (10/29/98) Major change is new values for 352 BGA Mechanical. See Table A-2 for complete edits. 


Table A-2 Edits to Current Revision 


Section Description 


Introduction ¢ Changed 266MHz reference to 300MH2z, page iii. 
1.0 - Overview Changed 266MHz references to 300MHz, pages 1 and 3. 


2.0 - Signal Changed GXm reference in CLMODE[2:0] signal description to MediaGX MMX-Enhanced 
Definitions processor. 


SDCLKO was incorrectly called out as AE4 in "BGA Pin No." column on page 29. 
Changed to AF4. 


3.0 - Processor In Table 3-7 "CR4-CRO Bit Definitions", CR4 bit 2 was incorrectly called out at TSD. Changed 
Programming to TSC. 


Corrected all SMI_LOCK and MAPEN index/register cross-references in Table 3-11 "Configu- 
ration Registers". 


Changed GXm references in Table 3-11 "Configuration Registers" to MediaGX MMX- 
Enhanced processor. 


In Table 3-16 "TR5-TR3 Bit Definitions", TR4 Register was not showing RSVD bits [2:0]. 
Added to table. 
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Table A-2 Edits to Current Revision (cont.) 


Section Description 


4.0 - Integrated | + In Table 4-6 "Display Driver Instructions" corrected Opcode for BBO_RESET from 0F72 to 
Functions OF3A and BB1_RESET from 0F73 to OF3B. 


* Changed text — Section 4.3.3 “SDRAM Commands” on page 119, third paragraph under 
"MRS". Was — The memory controller only supports a burst length of two and burst type of 
sequential. Now — The memory controller only supports a burst length of two and burst type 
of interleave. 


* Changed MA3 parameter in Table 4-14 "Address Line Programming during MRS Cycles" 
from ’0’ to 1’. 

* Modified Index 41h[1] decode for a setting of 1 in Table 4-41 "PCI Configuration Registers". 
Replaced the words "set by CFG Index OCh[7:0]" with "which is 16 bytes." 


8.0 - Package « New 352 BGA - modified dimensions and callouts in Figure 8-1 "352-Terminal BGA Mechan- 
Specifications ical Package Outline". Now also includes coplanarity value. 


« Removed legend from inside Figure 8-2 "320-Pin SPGA Mechanical Package Outline" and 
created new table - Table 8-3 "Mechanical Package Outline Legend". 


GXm_db_v2.0 Cyrix Corporation Confidential Page 277 


‘Media(OX 


M M X"*- E N H AN € E D 


Page 278 Cyrix Corporation Confidential GXm_db_v2.0 


A 


Absolute Maximum Ratings 

AC Characteristics 

Accessing 

Address Spaces 
Directory Table Entry (DTE) 
DTE Cache 
/O Address Space 
Memory Address Space 
Memory Addressing Modes 
Offset Mechanism 
Page Frame Offset (PFO) 
Page Table Entry (PTE) 
Paging Mechanism 
Translation Look-Aside Buffer 

Address Translation 

Application Register Set 


BGA Ball Assignments by Ball Number 
BGA Ball Assignments by Pin Name 
BGA Ball Assignments Diagram 


Cc 


Cache 
BBO_BASE 
BBO_POINTER 
BB1_BASE 
BB1_POINTER 
GCR register (Index B8h) 
L1 cache 
scratchpad memory 
Write-back caching 
Cache Controller 
Cache Disable, bit 30 
Cache Test Operations 
call gate 
Current Privilege Level 
Descriptor Privilege Level 
Descriptor Privilege Level in Destination 
Descriptors Bit Definitions 
Segment Selector Field 
CCR1 
Enable SMM Pins 
System Management Memory Access 
CCR1 Configuration Control Register 1 Index C1h 
CCR2 
Enable Suspend Pins 
Lock NW Bit 
Suspend on HALT 
Write-Through Region 1 
CCR2 Configuration Control Register 2 Index C2h 
CCR3 
Load/Store Serialize 1 GByte to 2 GBytes 
Load/Store Serialize 2 GBytes to 3 GBytes 
Load/Store Serialize 3 GBytes to 4 GBytes 
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107 
107 
107 
107 
107 
107 
107 
107 
107 
107 


76 


Map Enable 
NMI Enable 
SMM Register Lock 
CCR3 Configuration Control Register 3 Index C3h 
CCR4 
Directory Table Entry Cache 
Enable CPUID Instruction 
I/O Recovery Time 
Memory Read Bypassing 
SMI Nest 
CCR4 Configuration Control Register 4 Index E8h 
CCR7 
Cyrix Extended MMX Instructions Enable 
NMI Enabl 
CCR7 Configuration Control Register 7 Index EBh 
CKE 
Clock Enable Suspend 
Clock Mode 
Configuration Register Map 
Control Registers 
Device ID Registers 
Graphics/VGA Related Registers 
SMM Base Header Address Registers 
Configuration Register Summary 
Conforming Code Segments 
Control Transfer 
CPU_READ 
CPU_READ/WRITE 
EAX instructions 
EBX instructions 
CPU_WRITE 
CPUID Instruction 
EAX = 0000 0000h 
EAX = 0000 0001h 
EAX = 0000 0002h 
EAX = 8000 0000h 
EAX = 8000 0001h 
EAX = 8000 0002h 
EAX = 8000 0003h 
EAX = 8000 0004h 
EAX = 8000 0005h 
CPUID Levels 
CPUID Levels, Extended 
CRO Register 
Alignment Check Mask 
Cache Disable 
Emulate Processor Extension 
Monitor Processor Extension 
Not Write-Through 
Numerics Exception 
Paging Enable Bit 
Protected Mode Enable 
Task Switched 
Write Protect 
CR2 Register 
Page Fault Linear Address 
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53 
53 


54 
54 
54 
54 
54 
54 


54 
54 
54 
29 
29 
21 
51 
51 
51 
51 
51 
50 
76 
99 
111 


111 
111 
111 
240 
241 
241 
242 
243 
243 
244 
244 
244 
244 
240 
243 


279 


ia( OK XX 


M M X"- E N H AN € E 


CR3 Register 48 DC_BUF_SIZE (8328h-832Bh) 154 
Page Directory Base Register 48 DC_CB_ST_OFFSET (8314h-8317h) 154 
CR4 Register 48 DC_CFIFO_DIAG (837Ch-837Fh) 156 
Time Stamp Counter Instruction 48 DC_CURS_ST_OFFSET (8318h-831Bh) 154 
D DC_CURSOR_COLOR (83680h-8363h) 155 
> DC_CURSOR_X (8350h-8353h) 155 
DC Characteristics 217 DC_CURSOR_Y (8358h-835Bh) 155 
Descriptor Bit Structure 73 DC_DFIFO_DIAG (8378h-837Bh) 156 
Descriptor Types 99 DC_FB_ST_OFFSET (8310h-8313h) 154 
Descriptors DC_FP_H_TIMING (833Ch-833Fh) 155 
Gate 73 DC_FP_V_TIMING (834Ch-834Fh) 155 
gate 76 DC_GENERAL_CFG (8304h-8307h) 154 
Interrupt 73 DC_H_TIMING_1 (8330h-8333h) 155 
_Task 73 DC_H_TIMING_2 (8334h-8337h) 155 
Device Select 26 DC_H_TIMING_3 (8338h-833Bh) 155 
DEVSEL 26 DC_LINE_DELTA (8324h-8327h) 154 
DIMM 127 DC_OUTPUT_CFG (830Ch-830Fh) 154 
DIRO DC_PAL_ADDRESS (8370h-8373h) 156 
Device ID 56 DC_PAL_DATA (8374h-8377h) 156 
DIRO Device Identification Register 0 Index FEh 56 DC_SS_LINE_CMP (835Ch-835Fh) 155 
DIR1 DC_TIMING_CFG (8308h-830Bh) 154 
Device Identification Revision 56 DC_UNLOCK (8300h-8303h) 154 
DIR1 Device Identification Register 1 Index FFh 56 DC_V_LINE_CNT (8354h-8357h) 155 
Directory Table Entry 81 DC_V_TIMING_1 (8340h-8343h) 155 
Display Controller 145-177 DC_V_TIMING_2 (8344h-8247h) 155 
Buffer Organization 152 DC_V_TIMING_3 (8348h-834Bh) 155 
CODEC hardware 145 DC_VID_ST_OFFSET (8320h-8323h) 154 
Compression Logic 146 Memory Organization Registers 164 
Compression Technology 146 Display Driver 
CRT Display Modes 151 BBO RESET 110 
CRT RAMDAC Data Bus Formats 150 BB1 RESET 110 
Cursor Pattern Memory 153 CPU READ 110 
DC Memory Organization 152 CPU WRITE 110 
DC_CURSOR_COLOR Register Scratchpad 110 
(BX_BASE+8360h) 147 Display Driver Instructions 110 
Display Modes 148 Bn 58 
Display Timing 148 BS 58 
Dither/Frame-Rate Modulation (FRM) 148 BT 58 
Graphics Memory Map 152 DR7 and DR6 Bit Definitions 58 
Hardware Cursor 147 DR7 Register 58 
Memory Management 146 GD 58 
Motion Video Acceleration Support 147 Gn 58 
Output Ports 148 LENn 58 
Pixel Arrangement Within a DWORD 152 Ln 58 
RAMDAC 145 R/Wn 58 
TFT 148 DRAM Address Conversion 127 
TFT LCD flat panel 145 
TFT Panel Data Bus Formats 150 E 
TFT Panel Display Modes 149 EBP register 43 
VESA-compatible 148 EFLAGS Register 45 
VGA Display Support 153 Alignment Check Enable (AM) 45 
Display Controller Block Diagram 145 Auxiliary Carry Flag 45 
Display Controller Registers 154 Carry Flag 45 
Configuration and Status Registers 157 CPUID instruction 45 
DC_BORDER_COLOR (8368h-836Bh) 155 Direction Flag (DF) 45 
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I/O Privilege Level (IOPL) 
Identification Bit 
Interrupt Enable 
Nested Task (NT) 
Resume Flag (RF) 
Sign Flag 
Trap Enable Flag 
Virtual 8086 Mode (VM) 
EFLAGS register, bit 9 
EGA 
Electrical Connections 
NC-Designated Pins 
Power/Ground Connections 
Pull-Up/Pull-Down Resisters 
Unused Input Pins 
Electrical Specifications 
Absolute Maximum Ratings 
AC Characteristics 
Clock Signals 
DC Characteristics Table 
DCLK Timing 
Graphics Port Timing 
JTAG AC Specification 
JTAG Test Timings 
Output Valid Timing 
Part Numbers 
PCI Interface Signals 
Recommended Operating Conditions 
SDRAM Interface Signals 
Setup and Hold Timings 
SYSCLK Timing 
System Signals 
TCK Timing and Measurement Points 
Video Interface Signals 
Video Port Timing 
Exceptions 
Abort 
Fault 
Trap 
Extended MMxX Instruction Set 
Extended MMX™ Instruction Set 
Configuration Control Rregister 
Legend 


F 


Fields - index 

Fields - mod and r/m 

Fields - sreg3 

Fields - ss 

floating point error 

FPU 
Mode Control Register 
Register Set 
Status Register 
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Tag Word Register 

FPU Instruction Set 
Summary Notes 

FPU Mode Control Register 
Denormalized-operand error exception bit 
Divide-by-zero exception bit 
Invalid-operation exception bit 
Overflow error exception bit 
Precision Control Bits 
Precision error exception bit 
Rounding Control Bits 

FPU Operations 

FPU Registers 

FPU Status Register 
Condition code bit 3 
Condition code bits 
Copy of ES bit 
Denormalized-operand error exception bit 
Divide-by-zero exception bit 
Error indicator 
Invalid operation exception bit 
Overflow error exception bit 
Precision error exception bit 
Stack Full 
Top-of-Stack 
Underflow error exception bit 

FPU Tag Word Register (TAG7:0] 

frame buffer 


G 


Gates 

General Purpose Registers 

Global Descriptor Table Register (GDTR) 

Grant Lines 

Graphics Memory (GX_BASE+800000h) 

Graphics Pipeline 
BitBLT/vector engine 
Color Patterns 
Diagonal Error Register (108h-810Bh) 
Dither Patterns 
Error Register (8104-8107h) 
GP_BLT_MODE 
GP_BLT_MODE (8208h-820Bh) 
GP_BLT_STATUS (820Ch-820Fh) 
GP_DST/START_Y/XCOOR (8100h-8103h) 
GP_DST_XCOOR 
GP_DST_YCOOR 
GP_INIT_ERROR 
GP_PAT_COLOR_0 register 
GP_PAT_COLOR_1 (GX_BASE+81 12h) 
GP_PAT_COLOR_A (8110h) 
GP_PAT_COLOR_B (8114h) 
GP_PAT_DATA (8120h-812Fh) 
GP_RASTER_MODE (8200h-8203h) 
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GP_RASTER_MODE (GX_BASE+ 8200h) 137 Interleaving 127 
GP_RASTER_MODE Bit Patterns 138 Internal Bus Interface 112-115 
GP_SRC_COLOR (810Ch-810Fh) 139 Internal Bus Interface Unit 
GP_SRC_COLOR_0 (GX_BASE+810Ch) 138 640KB to 1MB 112 
GP_SRC_YCOOR 136 C-Bus 112 
GP_VECTOR_MODE (8204h-8207h) 140 FPU Error Support 112 
GP_VGA_BASE (8210h-8213h) 140 Graphics 112 
GP_VGA_LATCH (8214h-821 7h) 140 IRQ13 112 
GP_VGA_READ (8200h-8203h) 139 L1 cache 112 
GP_VGA_WRITE (8140h-8143h) 139 Processor Core 112 
Master/Slave Registers 136 Region Control Field Bit Definitions 115 
Monochrome Patterns 137 Registers (GX_BASE+8000h) 113 
Pattern Generation 136 SMI Interrupts 112 

graphics pipeline 190, 191 VGA Access 112 

Graphics Pipeline Block Diagram 135 X-Bus 112 

H Internal Bus Interface Unit Diagram 103 

Internal Bus Interface Unit Registers 

HALT 23 BC_DRAM_TOP (8000h-8003h) 113 

High Order Interleaving 127 BC_XMAP_1 (8004h-8007h) 113 

l BC_XMAP_2 (8008h-800Bh) 113 

/O Address Space 65 BC_XMAP_3 (800Ch-800Fh) 113 

Initialization 39 Internal Test Signals 

Initialization, CPU 39 Utes ee 

Initiator Ready Raw Clock 33 
IRDY 25 SDRAM Test Outputs 33 
TRDY 25 Test 33 

Instruction Fields 234 Test Clock 33 

Instruction Set i Test Data Input 33 
eee Field Encoding 236 TestDaiOupet : = 
Index Field 239 Thermal Diode Negative (TDN) 33 
Memory Addressing 237 Thermal Diode Positive (TDP) 33 
mod base Field Encoding table 239 Interrupt ee oer at 
mod r/m Field Encoding 237 Interrupt and Exception Priorities 85 
Opcode 235 Interrupt Descriptor Table 84 
prefix bytes 235 Interrupt Request Level 13 22 
reg Field 238 Interrupts 82 
s-i-b Byte 239 INTE. 83 
s-i-b present 239 Nhl 82 
sreg? field 238 Real Mode Error Codes 86 
sreg3 238 Real Mode, Exceptions 86 
ss Field 239 mie P = 
w Field Operand Size 235 

instruction set 233 INTE 45, 82, 85, 94, 96, 97 

Instruction Set Format Table 234 invalid opcode a1 

Instruction Set Formats 233 IRE T instruction 45 

Instruction Set Overview 41 L 

Instructions Legacy VGA 189 
Bit Test Instructions e Local Descriptor Table Register (LDTR) 72 
Exchange Instructions 41 LOCK 26 
One-operand Arithmetic and Logical 41 Lock Prefix 7 
Two-operand Arithmetic and Logical 41 Low Order Interleaving 127 

Instuction Prefix Summary 235 

Integrated Functions 103 M 

Integrated Functions Programming Interface 104 MediaGX™ Virtual VGA 193 
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Memory Address Space 
Memory Addressing 
Paging Mechanism 
Memory Addressing Modes 
Memory Controller 
Auto LOI 
1 DIMM Bank 
2 DIMM Banks 
Block Diagram 
DRAM Address Conversion 
DRAM Configuration 
Graphics Pipeline 
Memory Array Configuration 
Memory Cycles 
Memory Organization 
Non-Auto LOI 
1 DIMM Bank 
2 DIMM Banks 
Page Miss 
Processor Interface 
SDRAM 
SDRAM Commands 
ACT 
MRS 
PRE 
READ 
WRT 
SDRAM Initialization Sequence 
SDRAM Interface 
SDRAM Interface Clocking 
CAS latency 
SDRAM Read Cycle 
SDRAM Refresh Cycle 
Precharge Command 
SDRAM Write Cycle 
SHFTSDCLK 
X-Bus 
Memory Controller Interface Signals 
Bank Address Bits 
Chip Selects 
Clock Enable 
Column Address Strobe 
Data Mask Control Bits 
Memory Address Bus 
Row Address Strobe 
SDRAM Clocks 
Write Enable 
Memory Controller Register 
MC_BANK_CFG (8408h-840Bh) 
MC_DR_ACC (841 Ch-841 Fh) 
MC_DR_ADD (8418h-841Bh) 
MC_GBASE_ADD (8414h-8417h) 
MC_MEM_CNTRL1 (8400h-8403h) 
MC_MEM_CNTRL2 (8404h-8407h) 
MC_SYNC_TIM1 (840Ch-840Fh) 
Memory Data Bus 
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121 
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Multiplexed Address 
PCI pins 24 
Multiplexed Command 
Configuration Read 24 
Configuration Write 24 
Dual Address Cycle 24 
Memory Read Line 24 
Memory Read Multiple 24 
Memory Write and Invalidate 24 
Special Cycle 24 
Multiplexed Command and Byte Enables 
Interrupt Acknowledge 24 
Multitasking 77 
N 
NMI 53, 82, 84, 85, 87, 94, 96, 97 
notebook computers 201 
O 
Overflow Flag 45 
Pp 
Package Outlines 229 
Package Specifications 227 
Page Table Entry 81 
palette lookup 191 
PCI Arbitration 187 
PCI Configuration Registers 
Access Format 180 
Bus 179 
Cache Line Size (OCh) 180 
Class Code (09h-0Bh) 180 
CONFIG ENABLE 179 
CONFIG_DATA OCFCh-OCFFh 179 
Device 179 
Device Identification (02h-03h) 180 
Device Status (O6h-07h) 180 
Latency Timer (ODh) 180 
PCI Arbitration Control 1 (43h) 180 
PCI Arbitration Control 2 (44h) 180 
PCI Command (04h-05h) 180 
PCI Control Function 1 (40h) 180 
PCI Control Function 2 (41h) 180 
Register 179 
Revision Identification (08h) 180 
Translation Type Bits [1:0] 179 
Vendor Identification (0Oh-01h) 180 
PCI Configuration Registers OCF8h-OCFBh 179 
PCI Controller 178 
CONFIG_ADDRESS 178 
Configuration Cycles 178 
PCI Arbiter 178 
Space Control Registers 179 
Special Cycles 178 
X-Bus PCI Master 178 
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X-Bus PCI Slave 178 Voltage Detect(VOLDET) 32 
PCI Cycles 185 Privilege Level Transfers 98 
PCI Halt Command 188 Privilege Levels (CPL, DPL and RPL) 97 
PCI Interface Signals Privilege Levels (I/O) 98 
Frame 25 Processor Core Instruction Set 245 
Initiator Ready 25 Clock Counts 245 
Lock Operation 26 Flags 245 
Multiplexed Address and Data 24 Legend 245 
Multiplexed Command and Byte Enables 24 Opcodes 245 
Parity 25 Processor Initialization 39 
Parity Error 27 Programming Interface 39 
Request Lines 27 Protected Mode, Initialization and Transition 99 
Target Ready 25 Protection 97 
Target Stop 26 Current Privilege Level (CPL) 97 
PCI Local Bus Specification 185 Descriptor Privilege Level (DPL) 97 
PCI Read Transactions 185 Requested Privilege Level (RPL) 97 
PCI Special Cycle Command 188 Protection - V86 Mode 100 
PCI Write Transactions 186 R 
PCR Performance Control Register Index 20h 54 
PERR 27 Recommended Operating Conditions 216 
Pixel Arrangement Within a DWORD 152 Register Controls 40 
Pointer and Index Registers Register Sets 42 
ECX Counter 43 Application 
EDI Destination Pointer 43 Flags Register 43 
ESI Source Pointer 43 General Purpose Register 43 
ESP Register 43 Instruction Pointer Register 43 
PUSH and POP Instructions 43 Segment Registers 43 
Power and Ground Connections and Decoupling 213 Flags Register 45 
Power Management 201 General Purpose 43 
3-Volt Suspend Mode 203 Data Registers ' 43 
Advanced Power Management (APM) 201 Pointer and Index Registers 43 
CPU Suspend Command Registers 202 Instruction Pointer 44 
Initiating Suspend with HALT 205 Selection Rules 44 
Initiating Suspend with SUSP 204 Model Specific Register 42 
Processor Serial Bus 208 _System Register Set 42, 46 
Responding to a PCI Access During Suspend Mode 206 Registers 
Serial Packet Transmission 208 Application Register 43 
Stopping the Input Clock 207 Model Specific Register 64 
Suspend Mode and Bus Cycles 204 REQ 27 
Suspend Modulation 202 RESET : 39 
Power Management Registers 209 ROP (raster operation) 191 
PM_BASE (FFFF FF6Ch) 209 Row Address Strobe 
PM_CNTRL_CSTP (8508h-850Bh) 209 CAS 28 
PM_CNTRL_TEN (8504h-8507h) 209 CKE 28 
PM_MASK (FFFF FF7Ch) 209 RAS 28 
PM_SER_PACK (850Ch-850Fh) 209 RASA 28 
PM_STAT_SMI (8500h-8503h) 209 RASB 28 
Power Planes 36—37 WE 28 
Power, Ground, No Connect S 
Ground (VSS) 32 Scratchpad 
Power, Ground, No Connect Signals 2KB configurations 109 
Ground (VSS) 32 3KB configurations 109 
No Connect (NC) 32 SMM information 109 
Power Connect (VCC2) 32 Scratchpad RAM 109 
Power Connect (VCC3) 32 
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SDCLK_IN 29 NMI 27 
SDCLK_OUT 29 System Interface Signals 
Segment Register Selection Rules 44 Interrupt Request 22 
Segment Registers 44 Reset 22 
Serial Packet Serial Packet 23 
CX5520 23 Suspend Acknowledge 23 
VSA 23 Suspend Request 23 
Shutdown and Halt 97 System Clock 21 
Signal Definitions 9—20 System Management Interrupt 22 
Signal Descriptions 21 System Management Interrupt (SMI#) 189 
Cyrix Internal Test and Measurement Signals 33 System Register Set 46 
Memory Controller Interface Signals 28—29 System Register Sets 
PCI Interface Signals 24-27 Cache Test Registers 61 
Power, Ground and No Connect Signals 32 Configuration Registers 50 
System Interface Signals 21-23 Debug Registers 50 
Video Interface Signals 30-31 Gate Descriptors 76 
Signals - INTR 83 Task Register 76, 77 
Signals - NMI 82 System Registers 46—63 
Signals - SMM 83 Configuration Registers 50 
SIZE Control Registers 47 
SMM Region Size Bits 55 Debug Registers 57 
Skip Counts 106 Model Specific Register (MSR) 64 
SMAR Segment Descriptor Table Registers 72 
SMM Address Region Bits 55 Test Registers 59 
SMAR SMM Address Region Register T 
Indices CDh, CEH, CFh 55 
SMHR Task Gate Descriptors 77 
SMM Header Address 55 Task Register (TR) 77 
SMHR SMI Header Address Indices BOh, B1h, B2h, B8h 55 Task State Segments 77 
SMI Thermal Characteristics 227 
Configuration Registers 89 TR3 Register 62 
Generation 93 Cache Data 62 
SMI# pin 82, 84, 87, 89, 202 TR4 Register 62 
SMM 87 Dirty Bits 62 
CPU States 96 LRU Bits 62 
Instructions 92 Upper Tag Address 62 
Memory Space 93 Valid Bit 62 
Memory Space Header 90 TRS Register 62 
Operation 88 Control Bits 62 
SMI Enhancements 88 Line Selection 62 
SMI Events 89 TR6 Register 60 
SMI Nested States 95 Command Bit _ 60 
SMI Nesting 94 Dirty Attribute Bit 60 
SMI Service Routine Execution 94 Linear Address 60 
SMI# Pin 89 Valid Bit 60 
Suspend Mode 96 TR7 Register 60 
Suspend Mode CPU States 96 LRU Bits 60 
SMM Memory Space Header Description 91 Physical Address 60 
SPGA Pin Assignments by Pin Number 17 PLBit 60 
SPGA Pin Assignments by Signal Name 19 Set Selection 60 
SPGA Pin Assignments Diagram 16 Translation Lookaside Buffer 107 
STOP 26 V 
Subsystem Signal Connections 34-35 V86 Mode 
Suspend 64, 96, 97 Entering and Leaving 100 
Suspend Mode 23 Interrupt Handling 100 
System Error 27 
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Memory Addressing 
VESA 
VGA Address Mapping 
MapMask register 
Miscellaneous Output register 
VGA Configuration Registers 
VGA Control Register (B9h) 
VGA Mask Register (BAh-BDh) 
VGA Front End 
VGA function 
attribute controller 
CRT controller 
frame buffer 
general registers 
graphics controller 
sequencer 
VGA Hardware 
SMI Generation 
VGA Address Generator 
VGA Memory 
VGA Range Detection 
VGA Sequencer 
VGA Write/Read Path 
VGA Memory 
frame buffer address 
host address 
refresh address 
VGA Memory Addresses 
VGA Memory Organization 
VGA Range Detection 
VGA Sequencer 
VGA Video BIOS 
VGA Video Refresh 
All Points Addressable mode (APA) 
attribute controller (ATTR) 
CGA mode 
Chain 4 mode 
ClockSelect field 
ColorPlaneEnable register 
CRT controller (CRTC) 
LineCompare register 
Miscellaneous Output register 
ShiftRegister field 
VGA Write/Read Path 
Video Data Bus 
VID_CLK 
Video Interface Signals 
CRT Horizontal Sync 
CRT Vertical Sync 
Display Enable 
Dotclock 
Flat Panel Horizontal Sync 
Flat Panel Vertical Sync 
Graphics Pixel Data Bus 
Pixel Port Clock 
Video Clock 
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Video Data Bus 
Video Ready 
Video Valid 
video refresh 
Virtual 8086 Mode (V86) 
Virtual Subsystem Architecture (VSA) 
Virtual VGA 
ColorCompare register 
ColorDon’tCare register 
Datapath Elements 
read mode unit 
write-mode unit 
DataRotate register 
ReadMapSelect register 
SetReset register 
SMI Generation 
Virtual VGA Register Descriptions 
Virtual VGA Registers 
GP_VGA_WRITE (8140h-8143h) 
Virtual VGA Registers 
GP_VGA_BASE VGA (8210h-8213h) 
GP_VGA_LATCH (8214h-821 7h) 
GP_VGA_READ (8144h-8147h) 
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